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SYNTHETIC DNA SEQUENCE HAVING ENHANCED 
INSECTICIDAL ACTIVITY IN MAIZE 
This application is a continuation in part application 
of U.S. serial no. 772,027 filed October 4, 1991, which 
disclosure is herein incorporated in its entirety. 

Field of the Invention 
The present invention relates to DNA sequences encoding 
insecticidal proteins, and expression of these sequences in 
plants. 

Background of the Invention 
Expression of the insecticidal protein (IP) genes 
derived from Bacillus thuringiensis (Bt) in plants has proven 
extremely difficult. Attempts have been made to express 
chimeric promoter/Bt IP gene combinations in plants. 
Typically, only low levels of protein have been obtained in 
transgenic plants. See , for example, Vaeck et al., Nature 
328:33-37, 1987; Barton et al., Plant Physiol. 85:1103-1109, 
1987; Fischoff et al., Bio/Technology 5:807-813, 1987. 

One postulated explanation for the cause of low 
expression is that fortuitious transcription processing sites 
produce aberrant forms of Bt IP mRNA transcript. These 
aberrantly processed transcripts are non-functional in a plant, 
in terms of producing an insecticidal protein. Possible 
processing sites include polyadenylation sites, intron splicing 
sites, transcriptional termination signals and transport 
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signals. Most genes do not contain sites that will 
deleteriously affect gene expression in that gene's normal host 
organism. However, the fortuitous occurrence of such 
processing sites in a coding region might complicate the 
expression of that gene in transgenic hosts. For example, the 
coding region for the Bt insecticidal crystal protein gene 
derived from Bacillus ttmrinoiwii. strain Jcurstaki (GENBANK 
BTHKURHD, accession M15271, L thuringiensis.var . kurstaki , 
HD-l; Geiser et ml. Gene 48:109-118 (1986)) as derived directly 
from Bacillus f.hurinaiensis , might contain sites which prevent 
this gene from being properly processed in plants. 

Further difficulties exist when attempting to express 
Bacillus thurinai^nsis protein in an organism such as a plant . 
It has been discovered that the codon usage of a native Bt IP 
gene is significantly different from that which is typical of a 
plant gene. In particular, the codon usage of a native Bt IP 
gene is very different from that of a maize gene. As a result, 
the mRNA from this gene may not be efficiently utilized. Codon 
usage might influence the expression of genes at the level of 
translation or transcription or mRNA processing. To optimize 
an insecticidal gene for expression in plants, attempts have 
been made to alter the gene to resemble, as much as possible, 
genes naturally contained within the host plant to be 



Adang et ml., EP 0359472 (1990) , relates to a synthetic 
Bacillus thur^nsiensis t-enebrionis (Btt) gene which is 85% 
homologous to the native Btt gene and which is designed to have 
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an A-hT content approximating that found in plants in general. 
Table 1 of Adang et al. show the codon sequence of a synthetic 
Btt gene which was made to resemble more closely the normal 
codon distribution of dicot genes. Adang et al . state that a 
synthetic gene coding for IP can be optimized for enhanced 
expression in monocot plants through similar methods, 
presenting the frequency of codon usage of highly expressed 
monocot proteins in Table 1. At page 9, Adang et al. state 
that the synthetic Btt gene is designed to have an A+T content 
of 55% (and, by implication, a G+C content of 45%) . At page 
20 f Adang et al. disclose that the synthetic gene is designed 
by altering individual amino acid codons in the native Bt gene 
to reflect the overall distribution of codons preferred by 
dicot genes for each amino acid within the coding region of the 
gene. Adang et al. further state that only some of the native 
Btt gene codons will be replaced by the most preferred plant 
codon for each amino acid, such that the overall distribution 
of codons used in dicot proteins is preserved. 

Fischhoff et al., EP 0 385 962 (1990), relates to plant 
genes encoding the crystal protein toxin of Bacillus 
thurinaiensis . At table V, Fischhoff et al. disclose percent 
usages for codons for each amino acid. At page 8, Fischoff et 
al. suggest modifying a native Bt gene by removal of putative 
polyadenylation signals and ATTTA sequences. Fischoff et al. 
further suggest scanning the native Bt gene sequence for 
regions with greater than four consecutive adenine or thymine 
nucleotides to identify putative plant polyadenylation signals. 
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Fischoff et al. state that the nucleotide sequence should be 
altered if more than one putative polyadenylation signal is 
identified within ten nucleotides of each other. At page 9, 
Fischoff et al. state that efforts should be made to select 
codons to preferably adjust the G + C content to about 50%. 

Perlak et al., PNAS USA , 88:3324-3328 (1991), relates 
to modified coding sequences of the Bacillus thurinqiensis 
crylA(b) gene, similar to those shown in Fischoff et al. As 
shown in table 1 at page 3325, the partially modified crylA(b) 
gene of Perlalc et al. is approximately 96% homologous to the 
native crylA(b) gene (1681 of 1743 nucleotides), with a G+C 
content of 41%, number of plant polyadenylation signal 
sequences (PPSS) reduced from 18 to 7 and number of ATTTA 
sequences reduced from 13 to 7 . The fully modified crylA(b) 
gene of Perlalc et al. is disclosed to be fully synthetic (page 
3325, column 1) . This gene is approximately 79% homologous to 
the native crylA(b) gene (1455 of 1845 nucleotides), with a G + C 
content of 49%, number of plant polyadenylation signal 
sequences (PPSS) reduced to 1 and all ATTTA sequences removed. 

Barton et al., EP 0431 829 (199D . relates to the 
expression of insecticidal toxins in plants. At column 10, 
Barton et al. describe the construction of a synthetic AalT 
insect toxin gene encoding a scorpion toxin using the most 
preferred codon for each amino acid according to the chart 
shown in Figure 1 of the document. 

gumma r y of the Invention 
The present invention is drawn to methods for enhancing 
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expression of heterologous genes in plant cells. Generally, a 
gene or coding region of interest is constructed to provide a 
plant specific preferred codon sequence. In this manner, cOdon 
usage for a particular protein is altered to increase 
expression in a particular plant. Such plant optimized coding 
sequences can be operably linked to promoters capable of 
directing expression of the coding seqence in a plant cell. 

Specifically, it is one of the objects of the present 
invention to provide synthetic insecticidal protein genes which 
have been optimized for expression in plants. 

It is another object of the present invention to 
provide synthetic Bt insecticidal protein genes to maximize the 
expression of Bt proteins in a plant, preferably in a maize 
plant. It is one feature of the present invention that a 
synthetic Bt IP gene is constructed using the most preferred 
maize codons, except for alterations necessary to provide 
ligation sites for construction of the full synthetic 



According to the above objects, we have synthesized Bt 
insecticidal crystal protein genes in which the codon usage has 
been altered in order to increase expression in plants, 
particularly maize. However, rather than alter the codon usage 
to resemble a maize gene in terms of overall codon 
distribution, we have optimized the codon usage by using the 
codons which are most preferred in maize (maize preferred 
codons) in the synthesis of the synthetic gene. The optimized 
maize preferred codon usage is effective for expression of high 
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levels of the Bt insect icidal protein. This might be the 
result of maximizing the amount of Bt insecticidal protein 
translated from a given population of messenger RNAs . The 
synthesis of a Bt IP gene using maize preferred codons also 
tends to eliminate fortuitous processing sites that might occur 
in the native coding sequence. The expression of this 
synthetic gene is significantly higher in maize cells than that 

of the native IP Bt gene. 

Preferred synthetic, maize optimized DNA sequences of 
the present invention derive from the protein encoded by the 
crylA(b) gene in Bacillus t huringlensis var. kurstaki, HD-l; 
Geiser et al., Gene, 48:109-118 (1986) or the crylB gene (AKA 
Crya4 gene) described by Brizzard and Whiteley, Nuc. Acids, 
Res . , 16:2723 (1988) . The DNA sequence of the native kurstaki 
HD-l crylA(b) gene is shown as Sequence 1. These proteins are 
active against various lepidopteran insects, including Ostrinia 
nubilalis, the European Corn Borer. 

While the present invention has been exemplified by the 
synthesis of maize optimized Bt protein genes, it is recognized 
that the method can be utilized to optimize expression of any 

protein in plants. 

The instant optimized genes can be fused with a variety 
of promoters, including constitutive, inducible, temporally 
regulated, development ally regulated, tissue-preferred and 
tissue-specific promoters to prepare recombinant DNA molecules, 
i.e., chimeric genes. The maize optimized gene (coding 
sequence) provides substantially higher levels of expression in 
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a transformed plant, when compared with a non-maize optimized 
gene. Accordingly, plants resistant to Coleopteran or 
Lepidopteran pests, such as European corn borer and sugarcane 

borer, can be produced. 

It is another object of the present invention to 
provide tissue-preferred and tissue-specific promoters which 
drive the expression of an operatively associated structural 
gene of interest in a specific part or parts of a plant to the 
substantial exclusion of other parts. 

It is another object of the present invention to 
provide pith-preferred promoters. By "pith-preferred," it is 
intended that the promoter is capable of directing the 
expression of an operatively associated structural gene in 
greater abundance in the pith of a plant than in the roots, 

outer sheath, and brace roots, and with substantially no 

( 

expression in seed. 

It is yet another object of this invention to provide 
pollen-specific promoters. By "pollen-specific," it is 
intended that the promoter is capable of directing the 
expression of an operatively associated structural gene of 
interest substantially exclusively in the pollen of a plant, 
with negligible expression in any other plant part. By 
"negligible," it is meant functionally insignificant. 

It is yet another object of the present invention to 
provide recombinant DNA molecules comprising a tissue-preferred 
promoter or tissue-specific promoter operably associated or 
linked to a structural gene of interest, particularly a 
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structural gene encoding an insecticidal protein, and 
expression of the recombinant molecule in a plant. 

It is a further object of the present invention to 
provide transgenic plants which express at least one structural 
gene of interest operatively in a tissue-preferred or 
tissue-specific expression pattern. 

in one specific embodiment of the invention disclosed 
and claimed herein, the tissue-preferred or tissue-specific 
promoter is operably linked to a structural gene encoding an 
insecticidal protein, and a plant is stably transformed with at 
least one such recombinant molecule. The resultant plant will 

_ ^ ,- ( „,i ar insects which feed on those parts of 
be resxstant to particuxar 

the plant in which the gene(s) is (are) expressed. Preferred 
structural genes encode B.t. insecticidal proteins. More 
preferred are maize optimized B.t. IP genes. 

Brief Desr-T-iption o f t-h« Figures 
Fig. 1 is a comparison of the full-length native Bt 
crylA(b) gene [BTHKORHD] , a full-length synthetic maize 
optimized Bt crylA(b) gene [flsynbt.fin] and a truncated 
synthetic maize optimized Bt crylA(b) gene [bssyn] . This 
figure shows that the full-length synthetic maize optimized 
crylA(b) gene sequence matches that of the native crylA(b) gene 
at about 2354 out of 3468 nucleotides (approximately 68% 
homology) . 

Fig. 2 is a comparison of the truncated native Bt 
crylA(b) gene [BTHKURHD] and a truncated synthetic maize 

8 
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optimized Bt gene [bssyn] . This figure shows that the 
truncated synthetic maize optimized crylA(b) gene sequence 
matches that of the native crylA(b) gene at about 1278 out of 
1947 nucleotides (approximately 66% homology) . 

Fig. 3 is a comparison of the pure maize optimized Bt 
gene sequence [synlT.mze] with a truncated synthetic maize 
optimized Bt gene [bssyn] and a full-length synthetic maize 
optimized Bt gene modified to include restriction sites for 
facilitating construction of the gene [synful .mod] . This 
figure shows that the truncated synthetic maize optimized 
crylA(b) gene sequence matches that of the pure maize optimized 
crylA(b) gene at 1913 out of 1947 nucleotides (approximately 

98% homology) - 

Fig. 4 is a comparison of a native truncated Bt 
crylA(b) gene [BTHKURHD] with a truncated synthetic crylA(b) 
gene described in Perlak et al., PNAS USA , 88:3324-3328 (1991) 
[PMONBT] and a truncated synthetic maize optimized Bt gene 
[bssyn] . This figure shows that the PMONBT gene sequence 
matches that of the native crylA(b) gene at about 1453 out of 
1845 nucleotides (approximately 79% homology), while the 
truncated synthetic maize optimized Bt crylA(b) gene matches 
the native crylA(b) gene at about 1209 out of 1845 nucleotides 
(approximately 66% homology) . 

Fig. 5 is a comparison of a truncated synthetic 
crylA(b) gene described in Perlak et al., PNAS PSA, 
88:3324-3328 (1991) [PMONBT 1 and a truncated synthetic maize 
optimized Bt crylA(b) gene [bssyn]. This figure shows that the 



WO 93/07278 



PCT/US92/08476 



PMONBT gene sequence matches that of the truncated synthetic 
maize optimized Bt crylA(b) gene at about 1410 out of 1845 
nucleotides (approximately 77% homology) . 

Fig. 6 is a full-length, maize optimized Cry IB gene. 
Fig. 7 is a full-length, hybrid, partially maize 
optimized DNA sequence of a CrylA(b) gene which is contained in 
PCIB4434. The synthetic region is from nucleotides 1-1938 
(amino acids 1-646) , and the native region is from nucleotides 
1939-3468 (amino acids 647-1155) . The fusion point between the 
synthetic and native coding sequences is indicated by a slash 

(/) in the sequence. 

Fig. 8 is a map of pCIB4434. 

Fig. 9 is a full-length, hybrid, maize optimized DNA 
sequence encoding a heat stable CrylA(b) protein, contained in 

PCIB5511. 

Fig. 10 is a map of pCIB5511. 

Fig. 11 is a full-length, hybrid, maize optimized DNA 
sequence encoding a heat stable CrylA(b) protein, contained in 

pCIB5512. 

Fig. 12 is a map of pCIB5512. 

Fig. 13 is a full-length, maize optimized DNA sequence 
encoding a heat stable CrylA(b) protein, contained in P CIB5513 
Fig. 14 is a map of pCIB5513. 

Fig. 15 is a full-length, maize optimized DNA sequence 
encoding a heat-stable CrylA(b) gene, contained in pCIB5514. 
Fig. 16 is a map of pCIB5514 . 
Fig. 17 is a map of pCIB4418. 

10 
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Fig. 18 is a map of pCIB4420. 

Fig. 19 is a map of pCIB4429. 

Fig- 20 is a map of pCIB4431. 

Fig. 21 is a map of pCIB4428. 

Fig. 22 is a map of pCIB4430. 

Fig. 23A is a table containing data of crylA(b) protein 

levels in transgenic maize. 

Fig. 23B is a table which summarizes results of 
bioassays of Ostrinia and Diatraea on leaf material from maize 
progeny containing a maize optimized CrylA(b) gene. 

Fig. 23C is a table containing data of crylA(b) protein 

levels in transgenic maize. 

Fig. 23D is a table which summarizes the results of 
bioassays of Ostrinia and Diatraea on leaf material from maize 
progeny containing a synthetic Bt. maize gene operably linked 

to a pith promoter. 

Fig. 23E is a table containing data on expression of 
the crylA(b) gene in transgenic maize using the pith-preferred 
promoter. 

Fig. 24 is a complete genomic DNA sequence of a maize 
tryptophan synthase- alpha subunit gene. Introns, exons, 
transcription and translation starts, start and stop of cDNA 
are shown. $ = start and end of cDNA; +1 - transcription 
start; 73******* » primer extension primer; +1 = start of 
translation; +++ = stop codon; bp 1495-99 « CCAAT Box; bp 
1593-1598 - TATAA Box; bp 3720-3725 = poly A addition site; 
# above underlined sequences are PCR primers. 

11 
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Figs. 25A, 25B, 25C and 2SD are Northern blot analyses 
which show differential expression of the maize TrpA subunit 
gene in maize tissue at 2 hour, 4 hour, 18 hour, and 48 hour 
intervals, respectively, at -80-C with DuPont Cronex 
intensifying screens. P=pith; C=cob; BR-brace roots; ES=ear 
shank; LP-lower pith; MP=middle pith; OP=upper pith; S=seed; 
L=leaf; R=root; and P (upper left)=total pith. 

Fig. 26 is a Northern blot analysis, the two left lanes 
of which show the maize TrpA gene expression in the leaf (L) 
and pith (P) of Funk inbred lines 211D and 5N984. The five 
right lanes indicate the absence of expression in Funk 211D 
seed total RNA. S(l, 2,3, 4 and 5)= seed at 1, 2, 3, 4 and 5 
weeks post pollenation. L-leaf; P=pith; S#=seed # weeks post 
pollenation . 

Fig. 27 is a Southern blot analysis of genomic DNA Funk 
line 211D, probed with maize TrpA cDNA 8-2 ( P CIBS600) , wherein 
B denotes BamHI, E denotes EcoRI, EV denotes EcoRV, H denotes 
HINDIII, and S denotes Sad. IX, 5X and 10X denote 
reconstructed gene copy equivalents. 

Fig. 2BA is a primer extension analysis which shows the 
transcriptional start of the maize TrpA subunit gene and 
sequencing ladder. Lane +1 and +2 are IX + O.SX samples of 

primer extension reaction. 

Fig. 28B is an analysis of RNase protection from +2 bp 
to +387 bp at annealing temperatures of 42-C, 48«C and 54-C, at 
a 16 hour exposure against film at -80-C with DuPont Cronex 
intensifying screens. 

12 
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Fig. 29 is A map of the original Type II 
pollen-specific cDNA clone. The subcloning of the three EcoRI 
fragments into pBluescript vectors to create pCIB3168, pCIB3169 
and II-. 6 is illustrated. 

Fig. 30 shows the DNA sequence of the maize 
pollen-specific calcium dependent protein kinase gene cDNA, as 
contained in the 1.0 kb and 0.5 kb fragments of the original 
Type II cDNA clone. The EcoRI site that divides the 1.0 kb and 
0.5 kb fragments is indicated. This cDNA is not full length, 
as the mRNA start site maps 4 90 bp upstream of the end of the 
cDNA clone. 

Fig. 31 illustrates the tissue-specific expression of 
the pollen CDPK mRNA. RNA from the indicated maize 211D 
tissues was denatured/ electrophoresed on an agarose gel, 
transferred to nitrocellulose, and probed with the pollen CDPK 
cDNA 0.5 kb fragment. The mRNA is detectable only in the 
pollen, where a strong signal is seen. 

Fig. 32 is an amino acid sequence comparison of the 
pollen CDPK derived protein sequence and the rat protein kinase 
2 protein sequence disclosed in Tobimatsu et al., J. Biol. 
Chem. 263:16082-16086 (1988). The Align program of the DNAstar 
software package was used to evaluate the sequences. The 
homology to protein kinases occurs in the 5' two thirds of the 
gene, i.e. in the 1.0 kb fragment. 

Fig. 33 is an amino acid sequence comparison of the 
pollen CDPK derived protein sequence and the human calmodulin 
protein sequence disclosed in Fischer et al., J. Biol. Chem. 

13 
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263:17055-17062 (1988). The homology to calmodulin occurs in 
the 3' one third of the gene, i.e. in the 0.5 to fragment. 

Fig. 34 is an amino acid sequence comparison of the 
pollen CDPK derived protein sequence and soybean CDPK. The 
homology occurs over the entire gene. 

Fig. 35 illustrates the sequence of the maize 
pollen-specific CDPK gene. 1.4 to of sequence prior to the 
mRNA start site is shown. The positions of the seven exons and 
six introns are depicted under the corresponding DNA sequence. 
The site of polyadenylation in the cDNA clone is indicated. 

Fig. 36 is a map of pCIB4433. 

Fig. 37 is a full-length, hybrid, maize-optimized DNA 
sequence encoding a heat stable crylA(b) protein. 
Fig. 38 is a map of pCIB5515 . 

Description of th «* Sequences 
Sequence 1 is the DNA sequence of a full-length native 

Bt crylA(b) gene, 

sequence 2 is the DNA sequence of a full-length pure 

maize optimized synthetic Bt crylA(b) gene. 

Sequence 3 is the DNA sequence of an approximately 2 Kb 
truncated synthetic maize optimized Bt crylA(b) gene. 

Sequence 4 is the DNA sequence of a full-length 
synthetic maize optimized Bt crylA(b) gene. 

Sequence 5 is the DNA sequence of an approximately 2 Kb 
synthetic Bt gene according to Perlak et al. 

nailed Descrip tion of th»- Invention 

The following definitions are provided in order to 

14 
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provide clarity with respect to the terms as they are used in 
the specification and claims to describe the present invention. 

Maize preferred codon: Preferred codon refers to the 
preference exhibited by a specific host cell in the usage of 
nucleotide codons to specify a given amino acid. The preferred 
codon for an amino acid for a particular host is the single 
codon which most frequently encodes that amino acid in that 
host. The maize preferred codon for a particular amino acid 
may be derived from known gene sequences from maize. For 
example, maize codon usage for 2B genes from maize plants are 
listed in Table 4 of Murray et al., Nucleic A cids Research, 
17:477-498 (1989) , the disclosure of which is incorporated 
herein by reference. For instance, the maize preferred codon 
for alanine is GCC, since, according to pooled sequences of 2 6 
maize genes in Murray et al., supra , that codon encodes alanine 

36% of the time, compared tcf GCG (24%), GCA (13%), and GCT 

(27%) . 

Pure maize optimized sequence: An optimized gene or DNA 
sequence refers to a gene in which the nucleotide sequence of a 
native gene has been modified in order to utilize preferred 
codons for maize. For example, a synthetic maize optimized Bt 
crylA(b) gene is one wherein the nucleotide sequence of the 
native Bt crylA(b) gene has been modified such that the codons 
used are the maize preferred codons, as described above. A 
pure maize optimized gene is one in which the nucleotide 
sequence comprises 100 percent of the maize preferred codon 
sequences for a particular polypeptide. For example, the pure 

15 
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maize optimized Bt crylA(b) gene is one in which the nucleotide 
sequence comprises 100 percent maize preferred codon sequences 
and encodes a polypeptide with the same amino acid sequence as 
that produced by the native Bt crylA(b) gene. The pure 
nucleotide sequence of the optimized gene may be varied to 
permit manipulation of the gene, such as by altering a 
nucleotide to create or eliminate restriction sites. The pure 
nucleotide sequence of the optimized gene may also be varied to 
eliminate potentially deleterious processing sites, such as 
potential polyadenylation sites or intron recognition sites. 

It is recognized that "partially maize optimized, " 
sequences may also be utilized. By partially maize optimized, 
it is meant that the coding region of the gene is a chimeric 
(hybrid) , being comprised of sequences derived from a native 
insecticidal gene and sequences which have been optimized for 
expression in maize. A partially optimized gene expresses the 
insecticidal protein at a level sufficient to control insect 
pests, and such expression is at a higher level than achieved 
using native sequences only. Partially maize o] 
sequences include those which contain at least about 5% 

optimized sequences. 

Full-length Bt Genes: Refers to DNA sequences 
comprising the full nucleotide sequence necessary to encode the 
polypeptide produced by a native Bt gene. For example, the 
native Bt crylA(b) gene is approximately 3.5 Kb in length and 
encodes a polypeptide which is approximately 1150 amino acids 
in length. A full-length synthetic crylA(b) Bt gene would be 
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at least approximately 3.5 Kb in length. 

Truncated Bt Genes; Refers to DNA sequences comprising 
less than the full nucleotide sequence necessary to encode the 
polypeptide produced by a native Bt gene, but which encodes the 
active toxin portion of the polypeptide. For example, a 
truncated synthetic Bt gene of approximately 1 . 9 Kb encodes the 
active toxin portion of the polypeptide such that the protein 
product exhibits insecticidal activity. 

Tissue-preferred promoter: The term "tissue- 
preferred promoter" is used to indicate that a given regulatory 
DNA sequence will promote a higher level of transcription of an 
associated structural gene or DNA coding sequence, or of 
expression of the product of the associated gene as indicated 
by any conventional RNA or protein assay, or that a given DNA 
sequence will demonstrate some differential effect; i.e., that 
the transcription of the associated DNA sequences or the 
expression of a gene product is greater in some tissue than in 
all other tissues of the plant . 

"Tissue-specific promoter" is used to indicate that a 
given regulatory DNA sequence will promote transcription of an 
associated coding DNA sequence essentially entirely in one or 
more tissues of a plant, or in one type of tissue, e.g. green 
tissue, while essentially no transcription of that associated 
coding DNA sequence will occur in all other tissues or types of 
tissues of the plant. 

The present invention provides DNA sequences optimized 
for expression in plants, especially in maize plants. In a 
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preferred embodiment of the present invention, the DNA 
sequences encode the production of an insecticidal toxin, 
preferably a polypeptide sharing substantially the amino acid 
sequence of an insecticidal crystal protein toxin normally 
produced by Bacillus i-^rinaiensis . The synthetic gene may 
encode a truncated or full-length insecticidal protein. 
Especially preferred are synthetic DNA sequences which encode a 
polypeptide effective against insects of the order Lepidoptera 
and ccleoptera , and synthetic DNA sequences which encode a 
polypeptide having an amino acid sequence essentially the same 
as one of the crystal protein toxins of Bacillus thuringiensi s 

variety fcurstaki, HD-1 . 

The present invention provides synthetic DNA sequences 

effective to yield high expression of active insecticidal 
proteins in plants, preferably maize protoplasts, plant cells 
and plants. The synthetic DNA sequences of the present 
invention have been modified to resemble a maize gene in terms 
of codon usage and G+C content. As a result of these 
modifications, the synthetic DNA sequences of the present 
invention do not contain the potential processing sites which 
are present in the native gene. The resulting synthetic DNA 
sequences (synthetic Bt IP coding sequences) and plant 
transformation vectors containing this synthetic DNA sequence 
(synthetic Bt IP genes) result in surprisingly increased 
expression of the synthetic Bt IP gene, compared to the native 
Bt IP gene, in terms of insecticidal protein production in 
plants, particularly maize. The high level of expression 
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results in maize cells and plants that exhibit resistance to 
lepidopteran insects, preferably European Corn Borer and 
Diatrea saccharalis , the Sugarcane Borer. 

The synthetic DNA sequences of the present invention 
are designed to encode insecticidal proteins from Bacillus 
thurinqiensis , but are optimized for expression in maize in 
terms of G+C content and codon usage. For example, the maize 
codon usage table described in Murray et al., supra , is used to 
reverse translate the amino acid sequence of the toxin produced 
by the Bacillus thurinqiensis subsp. kurstaki HD-1 crylA(b) 
gene, using only the most preferred maize codons . The reverse 
translated DNA sequence is referred to as the pure maize 
optimized sequence and is shown as Sequence 4. This sequence 
is subsequently modified to eliminate unwanted restriction 
endonuclease sites, and to create desired restriction 
endonuclease sites. These modifications are designed to 
facilitate cloning of the gene without appreciably altering the 
codon usage or the maize optimized sequence. During the 
cloning procedure, in order to facilitate cloning of the gene, 
other modifications are made in a region that appears 
especially susceptible to errors induced during cloning by the 
polymerase chain reaction (PCR) . The final sequence of the 
maize optimized synthetic Bt IP gene is shown in Sequence 2. A 
comparision of the maize optimized synthetic Bt IP gene with 
the native kurstaki crylA(b) Bt gene is shown in Fig. 1. 

In a preferred embodiment of the present invention, the 
protein produced by the synthetic DNA sequence is effective 
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In a 



.gains, insects of U. order iezidoctera « ColeoESera. 
more preferred embodiment, the polypeptide encoded by the 
synthetic DNA sequence consists essentiaily of the full-length 
„ a truncated amino acid sequence of an insecticidal protein 
normally produced by Bacillus thnringiensis var. jeursta^ HD-1 . 
in a particular embodiment, the synthetic DBA sequence encodes 
a polypeptide consisting essentially of a truncated amino acid 
sequence of the Bt crylA(b) protein. 

The insecticidal proteins of the invention are 
expressed in a plant in an amount sufficient to control insect 
pests, i.e. insect controlling amounts. It is recognised that 
the amount of expression of insecticidal protein in a plant 
necessary to control insects may vary depending upon species of 
plant, type of insect, environmental factors and the like. 
Generally, the insect population -ill be kept below the 
economic threshold which varies from plant to plant. For 
example, to control European corn borer in maize, the economic 
threshold is .5 eggmass/plant which translates to about 10 
larvae/plant . 

The methods of the invention are useful for controlling 
a wide variety of insects including but not limited to 
rootworms, cutworms, armyworms, particularly fall and beet 
armyworms, wireworms, aphids, corn borers, particularly 

k««t- lesser corn stalk borer, 
European corn borers, sugarcane borer, lesser 

Southwestern corn borer, etc. 

in a preferred embodiment of the present invention, the 
synthetic coding DNA sequence optimized for expression in 
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maize comprises a G+C percentage greater than that of the 
native crylA(b) gene. It is preferred that the G+C percentage 
be at least about 50 percent, and more preferably at least 
about 60 percent. It is especially preferred that the G+C 
percent be about 64 percent. 

In another preferred embodiment of the present 
invention, the synthetic coding DNA sequence optimized for 
expression in maize comprises a nucleotide sequence having at 
least about 90 percent homology with the "pure" maize optimized 
nucleotide sequence of the native Bacillus thurinqiensis 
crylA(b) protein, more preferably at least about 95 percent 
homology, and most preferably at least about 98 percent. 

Other preferred embodiments of the present invention 
include synthetic DNA sequences having essentially the DNA 
sequence of Sequence ID No . 4, as well as mutants or variants 
thereof; transformation vectors comprising essentially the DNA 

■ 

sequence of Sequence ID No. 4; and isolated DNA sequences 
derived from the plasmids pCIB4406, pCIB4407, pCIB4413, 
pCIB4414, pCIB4416, pCIB4417, pCIB4418, pCIB4419, pCIB4420, 
PCIB4421, PCIB4423, pCIB4434, pCIB4429, pCIB4431, P CIB4433. 
Most preferred are isolated DNA sequences derived from the 
plasmids pCIB4418 and pCIB4420, pCIB4434, pCIB4429, pCIB4431, 
and pCIB4433. 

in order to construct one of the maize optimized DNA 
sequences of the present invention, synthetic DNA 
oligonucleotides are made with an average length of about 80 
nucleotides. These oligonucleotides are designed to hybridize 
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to Produce fragments comprising the various quarters of the 
truncated toxin gene. The oligonucleotides for a given quarter 
are hybridized and amplified using PGR. The quarters are then 
cloned and the cloned quarters are sequenced to find those 
containing the desired sequences. In one instance, the fourth 
quarter, the hybridized oligonucleotides are cloned directly 
without PGR amplification. Once all clones of four quarters 
are found which contain open reading frames, an intact gene 
encoding the active insecticidal protein is assembled. The 
assembled gene may then be tested for insecticidal activity 
against any insect of interest including the European Corn 
Borer (ECB) and the sugarcane borer. (Examples 5A and 5B, 
respectively) . When a fully functional gene is obtained, it is 
again sequenced to confirm its primary structure. The fully 
functional gene is found to give 100% mortality when bioassayed 
against ECB. The fully functional gene is also modified for 

expression in maize. 

The maize optimized gene is tested in a transient 
expression assay, e.g. a maize transient expression assay. 
The native Bt crylA(b) coding sequence for the active 
insecticidal toxin is not expressed at a detectable level in a 
maize transient expression system. Thus, the level of 
expression of the synthesized gene can be determined. By the 
present methods, expression of a protein in a transformed plant 
can be increased at least about 100 fold to about 50,000 fold, 
more specifically at least about 1,000 fold to at least about 
20,000 fold- 
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Increasing expression of an insecticial gene to an 
effective level does not require manipulation of a native gene 
along the entire sequence. Effective expression can be 
achieved by manipulating only a portion of the sequences 
necessary to obtain increased expression. A full-length, maize 
optimized CrylA(b) gene may be prepared which contains a 
protein of the native CrylA(b) sequence. For example, Figure 7 
illustrates a full-length, maize optimized CrylA(b) gene which 
is a synthetic-native hybrid. That is, about 2kb of the gene 
(nucleotides 1-1938) is maize optimized, i.e. synthetic. The 
remainder, C-terminal nucleotides 647-1155, are identical to 
the corresponding sequence native of the CrylA(b) gene. 
Construction of the illustrated gene is described in Example 6, 
below. 

It is recognized that by using the methods described 
herein, a variety of synthetic/native hybrids may be 
constructed and tested for expression. The important aspect of 
hybrid construction is that the protein is produced in 
sufficient amounts to control insect pests. In this manner, 
critical regions of the gene may be identified and such regions 
synthesized using preferred codons. The synthetic sequences 
can be linked with native sequences as demonstrated in the 
Examples below. Generally, N-terminal portions or processing 
sites can be synthesized and substituted in the native coding 
sequence for enhanced expression in plants. 

In another embodiment of the present invention, the 
maize optimized genes encoding crylA(b) protein may be 
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manipulated to render the encode. protein more heat stable or 
temperature stable compared to the native cryIA<b, P™. 
be. been shown that the crylA (b) gene found in Bacillus 
..„ r<n oiensis xurstafci HD-1 contains a 26 amino acid deletion, 
.hen compared with the crylA<a> and =ryIA<c> proteins, a. the 
-COOB half of the protein . This deletion leads to a 
temperature-sensitive crylAlb, protein, see M. Geiser, EP 0 
440 581, entitled -Temperaturstabiles Bacillus 
£SHsisai ensis-Torin» . Repair of this deletion with the 
corresponding region from the cryIA<a, or crylA(c) protein 
proves the temperature stability of the repaired protein, 
constructs of the full-length modified crylA(b) synthetic gene 
are designed to insert sequences coding for the missing amino 
acids at the appropriate place in the seguenc. without altering 
the reading frame and without changing the rest of the protein 
sequence. The full-length synthetic version of the gene is 
assembled by synthesizing a series of double-stranded DBA 
cassettes, each approximately 300 bp in size, using standard 
techniques of DMA synthesis and enzymatic reactions . The 
repaired gene is said to encode a -heat stable- or 
-temperature-stable- cryJA.b, protein, since it retain, more 
biological activity than its native counterpart when exposed to 
high temperatures . Specific sequences of maize optimized, beat 
stable crylA(b) genes encoding temperature stable proteins are 
set forth in Pigs. 9, 11, 13, and IS, and are also described in 

Example 7, below. 

The present invention encompasses maize optimized 
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coding sequences encoding other polypeptides, including those 
of other Bacillus thurinoiensis insecticidal polypeptides or 
insecticidal proteins from other sources. For example, cry IB 
genes can be maize optimized, and then stably introduced into 
plants, particularly maize. The sequence of a maize optimized 
crylB gene constructed in accordance with the present invention 
is set forth in Fig. 6. 

Optimizing a Bt IP gene for expression in maize using 
the maize preferred codon usage according to the present 
invention results in a significant increase in the expression 
of the insecticidal gene. It is anticipated that other genes 
can be synthesized using plant codon preferences to improve 
their expression in maize or other plants. Use of maize codon 
preference is a likely method of optimizing and maximizing 
expression of foreign genes in maize. Such genes include genes 
used as selectable or scoreable markers in maize 
transformation, genes which confer herbicide resistance, genes 
which confer disease resistance, and other genes which confer 

insect resistance. 

The synthetic crylA(b) gene is also inserted into 
Aarobacterium vectors which are useful for transformation of a 
large variety of dicotyledenous plant species. (Example 44) . 
Plants stably transformed with the synthetic crylA(b) 
Aarobacterium vectors exhibit insecticidal activity. 

The native Bt crylA(b) gene is quite A+T rich. The G+C 
content of the full-length native Bt crylA(b) gene is 
approximately 39%. The G+C content of a truncated native Bt 
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=ryl*<b> gene of about 2 Kb in length i. approximately *». I- 
ge „er*l. maize coding regions tend to be predominantly G+ C 
rich . The Edifications made to the Bt cryIAO» gene result m 
a synthetic IE coding region »hich has greater than 50% 0 + C 
content, end has about 65% homology at the OHA level with the 
native cryUMb, gene. The protein encoded by this synthetic 
CryI*Cb, gene is 100% homologous with the native protein, and 
thus retains full function in terms of insect activity . The 
truncated synthetic CrylA.b, I> gene is about 2 Kb in length 
and the gene encodes the active toxin region of the native Bt 
SHS£SS i CryiXtb) insecticidal protein. The length of the 
^Tencoded by the truncated synthetic Cry IA (b, gene is 

amino acids. 

-p nresent invention are useful 

The synthetic genes of the presenr xn 

tot enhanced expression in transgenic Plants, most preferably 
in transformed maize. The transgenic plants of the present 
invention may be used to express the insecticidal CryTMb, 
protein at a high level, resulting in resistance to insect 
pests, preferably coleopteran or lepidopteran insects, and most 
preferably European Corn Borer (ECB) and sugarcane Borer. 

m the present invention, the DHA coding sequence of 
the synthetic maize optimised gen. may be under the control of 
regulatory elements such a. promoters which direct expression 

.„ rh reoulatory elements, for example, 
of the coding sequence. Such regulatory 

include monocct or maize and otber monococ functional promoters 
to provide expression of the gene in various parts of the marze 
piant. The regulatory element may be constitutive. That is, 
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it may promote continuous and stable expression of the gene. 
Such promoters include but are not limited to the CaMV 35S 
promoter; the CaMV 19S promoter; tumefaciens promoters such 
as octopine synthase promoters, mannopine synthase promoters, 
nopaline synthase promoters, or other opine synthase promoters; 
ubiquitin promoters, actin promoters, histone promoters and 
tubulin promoters. The regulatory element may be a 
tissue-preferential promoter, that is, it may promote higher 
expression in some tissues of a plant than in others. 
Preferably, the tissue-preferential promoter may direct higher 
expression of the synthetic gene in leaves, stems, roots and/ or 
pollen than in seed. The regulatory element may also be 
inducible, such as by heat stress, water stress, insect feeding 
or chemical induction, or may be developmentally regulated. 
Numerous promoters whose expression are known to vary in a 
tissue specific manner are known in the art. One such example 
is the maize phosphoenol pyruvate carboxylase (PEPC) , which is 
green tissue-specific. See , for example, Hudspeth, R.L. and 
Grula, J.W., Plant Molecular Biology 12:579-589, 1989). Other 
green tissue-specific promoters include chlorophyll a/b binding 
protein promoters and RubisCO small subunit promoters. 

The present invention also provides isolated and 
purified pith-preferred promoters. Preferred pith-preferred 
promoters are isolated from graminaceous monocots such as 
sugarcane, rice, wheat, sorghum, barley, rye and maize; more 
preferred are those isolated from maize plants. 

In a preferred embodiment, the pith-preferred promoter 
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is isolated from a plant TrpA gene; in a most preferred 
embodiment, it is isolated from a maize TrpA gene. That is, 
the promoter In its native state is operatively associated with 
a maize tryptophan synthase-alpha subunit gene (hereinafter 
"TrpA") . The encoded protein has a molecular mass of about 
38kD. Together with another alpha subnit and two beta 
subunits, TrpA forms a multimeric enzyme, tryptophan synthase. 
Each subunit can operate separately, but they function more 
efficiently together. TrpA catalyzes the conversion of indole 
glycerol phosphate to indole. Neither the maize TrpA gene nor 
the encoded protein had been isolated from any plant before 
Applicants' invention. The Arabidopsis thaliana tryptophan 
synthase beta subunit gene has been cloned as described Wright 
et al., th- Plant Cell , 4:711-719 (1992). The instant maxze 
TrpA gene has no homology to the beta subunit encoding gene. 

The present invention also provides purified 
pollen-specific promoters obtainable from a plant 
calcium-dependent phosphate kinase (CDEK) gene. That is, in 
its native state, the promoter is operably linked to a plant 
CDPK gene. In a preferred embodiment, the promoter is isolated 
fr om a maize CDPK gene. By "pollen-specific," it is meant that 
the expression of an operatively associated structural gene of 
interest is substantially exclusively (i.e. essentially 
entirely) in the pollen of a plant, and is negligible in all 
other plant parts. By "CDPK, " it is meant a plant protein 
kinase which has a high affinity for calcium, but not 
calmodulin, and requires calcium, hut not calmodulin, for its 
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catalytic activity. 

To obtain tissue-preferred or tissue specific 
promoters, genes encoding tissue specific messenger RNA (mRNA) 
can be obtained by differential screening of a cDNA library. 
For example, a pith-preferred cDNA can be obtained by 
subjecting a pith cONA library to differential screening using 
cDNA probes obtained from pith and seed mRNA. See, Molecular 
Cloning, A Laboratory Manual, Sambrook et al. eds . Cold Spring 

Harbor Press: New York (198 9) . 

Alternately, tissue specific promoters may be obtained 
by obtaining tissue specific proteins, sequencing the 
N-terminus, synthesizing oligonucleotide probes and using the 
probes to screen a cDNA library. Such procedures are 
exemplified in the Experimental section for the isolation of a 

pollen specific promoter. 

The scope of the present invention in regard to the 
pith-preferred and pollen-specific promoters encompasses 
functionally active fragments of a full-length promoter that 
also are able to direct pith-preferred or pollen-; 
transcription, respectively, of associated structural genes. 
Functionally active fragments of a promoter DNA sequence may be 
derived from a promoter DNA sequence, by several art-recognized 
procedures, such as, for example, by cleaving the promoter DNA 
sequence using restriction enzymes, synthesizing in accordance 
with the sequence of the promoter DNA sequence, or may be 
obtained through the use of PCR technology. See, e.g. Mullis 
et al., Meth. Enzvmol. 155:335-350 (1987); Erlich (ed.), PCR 
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2 echnol 22Z , Stockton Press (New York 1989) . 

Further included within the scope of the instant 
invention are pith-preferred and pollen-specific promoters 
-equivalent- to the full-length promoters. That is, different 
nucleotides, or groups of nucleotides may be modified, added or 
deleted in a manner that does not abolish promoter activity in 

accordance with known procedures. 

. i n*»ti from a maize TrpA 

A pith-preferred promoter obtained rrom 

« Pio 24 Those skilled in the art, with this 

gene is shown in Fig. 

sequence information in hand, will recognize than 
pith-preferred promoters included within the scope of the 
present invention can be obtained from other plants by probing 
pith libraries from these plants -ith probes derived from the 
maize TrpA structural gene. Probes designed from sequences 
that are highly conserved among TrpA eubunit genes of various 
species, as discussed generally in Example 17, are preferred, 
other pollen-epecific promoters, «hi=h in their netive state 
are linked' to plant CDPK genes other than maize, can be 
isolated in similar fashion using probes derived from the 
conserved regions of the maize CDPK gene to probe pollen 
libraries . 

In another embodiment of the present invention, the 
pith-preferred or pollen-specific promoter is operebly linked 
t0 a ON* sequence, i.e. structurel gene, encoding a protein of 
interest to form a recombinant DNA molecule or chimeric gene . 
The phrase "operafcly linked to" has an art-recognized meaning; 
it may be used interchangeably with -operatively associated 
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with, ""linked to," or "fused to". 

The structural gene may be homologous or heterologous 
with respect to origin of the promoter and/or a target plant 
into which it is transformed. Regardless of relative origin, 
the associated DNA sequence will be expressed in the 
transformed plant in accordance with the expression properties 
of the promoter to which it is linked. Thus, the choice of 
associated DNA sequence should flow from a desire to have the 
sequence expressed in this fashion. Examples of heterologous 
DNA sequences include those which encode insect icidal proteins, 
e.g. proteins or polypeptides toxic or inhibitory to insects or 
other plant parasitic arthropods, or plant pathogens such as 
fungi, bacteria and nematodes. These heterologous DNA 
sequences encode proteins such as magainins, Zasloff, PNAS USA, 
84:5449-5453 (1987); cecropins, Hultmark et al., Eur. J. 
Biochem. 127:207-217 (1982); attacins, Hultmark et al., EMBO J, 
2:571-576 (1983); melittin, gramicidin S, Katsu et al., 
Biochem. Biophvs. Acta , 939:57-63 (1988); sodium channel 
proteins and synthetic fragments, Oiki et al. PNAS USA, 
85:2395-2397 (1988); the alpha toxin of staphylococcus aureusm 
Tobkes et al., Biochem, , 24:1915-1920 (1985); apolipoproteins 
and fragments thereof, Knott et al-, Science 230:37 (1985); 
Nakagawa et al., J. Am. Chem. Soc 107:7087 (1985); 
alamethicin and a variety of synthetic amphipathic peptides, 
Kaiser et al., Ann. Rev. Biophvs . Biophvs: Chenu 16:561-581 
(1987); lectins, Lis et al., Ann. Rev - Biochem., 55:35-68 
(1986); protease and amylase inhibitors; and insecticidal 
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proteins fro» Bacillus L»"in<,iensis,, Particularly chs 
oalta-endotoxin, fro. fc. taurinaiensis; snd fro. orhar oaoraria 

or fungi. 

in a preferred embodiment of the invention, a 
pith-preferred promoter obtained from a maize TrpA subunit gene 
or pollen-specific promoter obtained from a maize CDPK gene is 
operably linked to a heterologous DNA sequence encoding a 
Bacillus ^^-tnaiensis («B.t.»> insecticidal protein. These 
proteins and the corresponding structural genes are well known 
in the art. See, Hofte and Whiteley, Mlrrohiol. Reviews , 

53:242-255 (1989) . 

While it is recognized that any promoter capable of 

directing expression can be utilized, it may be preferable to 
use heterologous promoters rather than the native promoter of 
the protein of interest. In this manner, chimeric nucleotide 
seq uences can be constructed which can be determined based on 
the plant to be transformed as well as the insect pest. For 
example, to control insect pests in maize, a monocot or maxze 
promoter can be operably linked to a Bt protein. The maize 
promoter can be selected from tissue-preferred and tissue- 
specific promoters such as pith-preferred and pollen-: ^ 
promoters, respectively as disclosed herein. 

in some instances, it may be preferred to transform the 
plant cell with more than one chimeric gene construct. Thus, 
for example, a single plant could be transformed with a 
pith-preferred promoter operably linked to a Bt protein as well 
as a pollen-specific promoter operably linked to a Bt protein. 
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The transformed plants would express Bt proteins in the plant 
pith and pollen and to a lesser extent the roots, outer sheath 

and brace roots . 

For various other reasons, particularly management of 
potential insect resistance developing to plant expressed 
insecticidal proteins, it is beneficial to express more than 
one insecticidal protein (IP) in the same plant. One could 
express two different genes (such as two different Bacillus 
thurinqiensis derived delta-endotoxins which bind different 
receptors in the target insect's midgut) in the same tissues, 
or one can selectively express the two toxins in different 
tissues of the same plant using tissue specific promoters. 
Expressing two Bt genes (or any two insecticidal genes) in the 
same plant using three different tissue specific promoters 
presents a problem for production of a plant expressing the 
desired phenotype. Three different promoters driving two 
different genes yields six different insecticidal genes that 
need to be introduced into the plant at the same time. Also 
needed for the transformation is a selectable marker to aid in 
identification of transformed plants. This means introducing 
seven different genes into the plant at the same time. It is 
most desired that all genes, especially the insecticidal genes, 
integrate into the plant genome at the same locus so they will 
behave as a single gene trait and not as a multiple gene trait 
that will be harder to track during breeding of commercial 
hybrids. The total number of genes can be reduced by using 
differential tissue specific expression of the different 
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insecticidal proteins. 

F or example, by fusing cryIA<b> with the pollen and PEP 
carboxylase promoters, one would obtain expression of this gene 
in green tissues and pollen. Fusing a pith-preferred promoter 
with the crylB delta endotoxin from Bacillus thurin^iensis 
would produce expression of this insecticidal protein most 
abundantly in the-pith of a transformed plant, but not in seed 
tissues. Transformation of a plant with three genes, PEP 

T *m» oollen/crvIAtb) , and pith/cry IB produces 
carboxylase /cry IA(b) , polien/cryiAwi # 

, • -i nsecticidal endotoxins in 

a plant expressing two different Bt insecticia 

different tissues of the same plant. CrylA(b) would be 
expressed in the -outside" tissues of a plant (particularly 
maize) , that is, in those tissues which European corn borer 
feeds on first after hatching. Should ECB prove resistant to 
crylA(b) and be able to burrow into the stalk of the plant 
after feeding on leaf tissue and/or pollen, it would then 
encounter the crylB delta-endotoxin and be exposed to a second 
insecticidal component. In this manner, one can differentially 
express two different insecticdal components in the same plant 
and decrease the total number of genes necessary to introduce 
as a single genetic unit while at the same time providing 
protection against development of resistance to a single 

insecticidal component- 

Likewise, a plant may be transformed with constructs 
encoding more than one type of insecticidal protein to control 
various insects. Thus, a number of variations may be 
constructed by one of skill in the art. 
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The recombinant DNA molecules of the invention may be 
prepared by manipulating the various elements to place them in 
proper orientation. Thus, adapters or linkers may be employed 
to join the DNA fragments. Other manipulations may be 
performed to provide for convenient restriction sites, removal 
of restriction sites or superfluous DNA. These manipulations 
can be performed by art-recognized methods. See, Sambrook et 
al., Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, second edition, 1989. For example, 
methods such as restriction, chewing back or filling in 
overhangs to provide blunt ends, ligation of linkers, 
complementary ends of the DNA fragments can be provided for 
joining and ligation. See , Sambrook et al., supra. 

Other functional DNA sequences may be included in the 
recombinant DNA molecule, depending upon the way in which the 
molecule is to be incorporated into the target plant genome. 
For instance, in the case of Aorobacterium-mediated 
transformation, if Ti- or the Ri- plasmid is used to transform 
the plant cells, the right and left borders of the T-DNA of the 
Ti- and Ri- plasmid will be joined as flanking regions to the 
expression cassette. Aarobacterium tumef aciens-mediated 
transformation of plants has been described in Horsch et al., 
Science , 225:1229 (1985); Marton, Cell Culture Somatic Cell 
Genetics of Plants , 1:514-521 (1984); Hoekema, Inj. The Binary 
Plant Vector System Of f set-Drukkeri j Kanters B.V., 
Alblasserdam, 1985, Chapter V Fraley, et al., Crit. Rev. Plant 
Sci., 4:1-46; and An et al., EMBO J. , 4:277-284 (1985). 
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The recombinant DNA molecules of the invention also can 
include a marker gene to facilitate selection in recombinant 
plant cells. Examples of markers include resistance to a 
biocide such as an antibiotic, e.g. kanamycin, hygromycin, 
chloramphenicol, paramomycin, methotrexate and bleomycin, or a 
herbicide such as imidazoles, sulfonylureas, glyphosate, 
phosphinothricin, or bialaphos. Marker genes are well known in 
the art. 

in another embodiment of the present invention, 
plants stably transformed with a recombinant DNA molecule or 
chimeric gene as described hereinabove are provided. The 
resultant transgenic plant contains the transformed gene stably 
incorporated into its genome, and will express the structural 
gene operably associated to the promoter in the respective 
fashion- 

Transgenic plants encompassed by the instant invention 
include both monocots and dicots . Representative examples 
include maize, tobacco, tomato, cotton, rape seed, soybean, 
wheat, rice, alfalfa, potato and' sunflower . [others?]. 
Preferred plants include maize, particularly inbred maize 



\ 



All transformed plants encompassed by the instant 
invention may be prepared by several methods known in the art. 
» tumefaciens -mediated transformation has been disclosed 
above. Other methods include direct gene transfer into 
protoplasts, Paszkowski et al., EMBO_^, 12:2717 (1984); Loerz 
et al., MQj ««n. £ Genet. , 1199:178 (1985); Fromm et al., 
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Nature 319:719 (1986); raicropro jectile bombardment: , Klein et 
al., Bio/Technology , 6:559-563 (1988); injection into 
protoplasts, cultured cells and tissues, Reich et al., 
Bio/Technology , 4:1001-1004 (1986); or injection into 
meristematic tissues or seedlings and plants as described by De 
La Pena et al., Nature , 325:274-276 (1987); Graves et al., 
Plant Mol. Biol. , 7:43-50 (1986); Hooykaas-Van Slogteren et 
al., Nature , 311:763-764 (1984); Grimsley et al . , 
Bio/Technology , 6:185 (1988); and Grimsley et al., Nature, 
325:177 (1988); and electroporation, WO92/09696. 

The expression pattern of a structural gene operatively 
associated with an instant tissue-preferred or tissue-specific 
promoter in a transformed plant containing the same is critical 
in the case where the structural gene encodes an insecticidal 
protein. For example, the instantly disclosed pith-preferred 
expression pattern will allow the transgenic plant to tolerate 
and withstand pathogens and herbivores that attack primarily 
the pith, but also the brace roots, outer sheath and leaves of 
the plant since the protein will be expressed to a lesser 
extent but still in an insect controlling amount in these plant 
parts, but yet in the case of both types of promoters, will 
leave the seed of the plant unaffected. 

EXAMPLES 

The following examples further describe the materials 
and methods used in carrying out the invention. They are 
offered by way of illustration, and not by way of limitation. 
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EXAMPLE 1: General Methods 

DNA manipulations were done using procedures that are 
standard in the art. These procedures can often be modified 
and/or substituted without substantively changing the result. 
Except where other references are identified, most of these 
procedures are described in Sambroo* et al . , Molecular Clon.n 
a Laboratory Manual , Cold Spring Harbor Laboratory Press, 
second edition, 1989. 

Synthesis of DNA oligomers: 

DNA oligomers which are from about twenty to about 
ninety, preferably from about sixty to about eighty nucleotides 
in length, are synthesized using an Applied Biosystems 
model 380B DNA synthesizer and standard procedures. The 
oligomers are made using the updated SSCAF3 cycle on a 0.2 
umole, wide pore, small scale ABI column. The end procedure is 
run trityl off and the oligomer is cleaved from the column 
using the 380B's automatic cleavage cycle. The oligomers are 
then deblocked in excess ammonium hydroxide (NH^OH) at 55'C for 
8-12 hours. The oligomers are then dried in an evaporator 
using nitrogen gas. After completion, the oligomers are 
resuspended in 0.25 - 0.5 ml of deionized water. 

Purification of synthetic oligomers: 

An aliquot of each oligomer is mixed with an equal 
volume of blue dye\formamide mix with the final solution 
containing 0.05% bromophenol blue, 0.05* xylene cyanol FF, and 
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25 % formamide. This mixture is heated at 95 -C for 10 minutes 
to denature the oligomers. Samples are then applied to a 12 % 
polyacrylamide-urea gel containing 7 M urea (Sambrook et al . ) . 
After electrophoresis at 300-400 volts for 3-4 hours using a 
Vertical Slab Gel Unit (Hoefer Scientific Instruments, San 
Francisco, CA) , UV shadowing is used to locate the correct 
sized fragment in the gel which was then excised using a razor 
blade. The purified gel fragment is minced and incubated in 
0.4 M LiCl, 1 mM EDTA <pH 8) buffer overnight at 37»C. 

Either of two methods is used to separate the oligomers 
from the polyacrylamide gel remnants: Gene\X 25 uM porous 
polyethylene filter units or Millipore' s ultrafree-MC 0.45 uM 
filter units. The purified oligomers are ethanol precipitated, 
recovered by centrifuging in a microfuge for 20 min at 4«C, and 
finally resuspended in TE (10 mM Tris, 1 mM EDTA, pH 8.0) . 
Concentrations are adjusted to 50 ng\yl based on absorption 
readings at 260 run. 

Kinasing oligomers for size determinations: 

To check the size of some of the oligomers on a 
sequencing gel, kinase labeling reactions are carried out using 
purified synthetic oligomers of each representative size: 
40mers, SOmers, 70mers, 80mers, and 90mers. In each 20 pi 
kinasing reaction, one pmole of purified oligomer is used in a 
buffer of 7.0 mM Tris pH 7.5, 10 mM KC1, 1 mM MgC12) , 0.5 mM 
DTT, 50 ug/ml BSA, 3000 uCi (3 pmoles) of 32P-gammaATP, and 
8 units of T4 polynucleotide kinase. The kinase reaction is 
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incused for 1 hour at 37-C, followed by a phenol Chloroform 
extraction and three ethanol precipitations with glycogen as 
carrier (Tracy, »~ P - Biochem. 11:251-268 (1981) . 

Two gel loadings (one containing 1000 cpm, the other 
containing 2000 cpm) of each reaction are prepared with 25% 
f oroide, 0.05* bromophenol blue, and 0.05% xylene cyanol FF. 
The xinased oligomers are boiled for 5 minutes before loadxng 
on a 6 % polyacrylamide, 7 M urea sequencing gel (BBL Gel 
Mix TM6, BRL, Gaithersburg, MD, . A sequencing reaction of 
plasmid PUC18 is run on the same gel to provide size markers. 

After electrophoresis, the gel is dried and exposed to 
diagnostic X-ray film (Kodak, X-OMAT AR> . The resulting 
autoradiograph shows all purified oligomers tested to be of the 
correct size. Oligomers which had not been sized directly on 
the sequencing gel are run on a 6 % polyacrylamide, 7 M urea 
gel (BRL Gel Mix TM6) , using the sized oligomers as size 
.arkers. All oligomers are denatured first with 25 % formamide 
at 100-C for 5 minutes before loading on the gel. Ethidxum 
bromide staining of the polyacrylamide gel allows all the 
oligomers to be visualized for size determination. 

Hybridizing oligomers for direct cloning: 

Oligomers to be hybridized are pooled together (from 1 
ug to 20 ug total DNA) and kinased at 37-C for 1 hour in IX 
Promega ligation buffer containing 30 mM Tris-HCl P H 7.8, 10 mM 
MgC12, 10 mM DTT, and 1 mM dATP . One to 20 units of T4 
polynucleotide kinase is used in the reaction, depending on the 
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amount of total DNA present. The kinasing reactions are 
stopped by placing the reaction in a boiling water bath for 
five minutes. Oligomers to form the 5' termini of the 
hybridized molecules are not kinased but are added to the 
kinased oligomers along with additional hybridization buffer 
after heating. The pooled oligomers are in a volume of 50-100 
ul with added hybridization buffer used to adjust the final 
salt conditions to 100 mM NaCl, 120 mM Tris pH 7.5, and 10 mM 
MgC12. The kinased and non-kinased oligomers are pooled 
together and heated in a boiling water bath for five minutes 
and allowed to slowly cool to room temperature over a period of 
about four hours. The hybridized oligomers are then 
phenol \chloroform extracted, ethanol precipitated, and 
resuspended in 17 ul of TE (10 mM Tris, 1 mM EDTA, pH 8.0) . 
Using this 17 ul, a ligation reaction with a final volume of 20 
pi is assembled (final conditions - 30 mM Tris-HGl pH 7.8, 10 
mM MgC12, 10 mM DTT, 1 mM ATP, and 3 units of T4 DNA ligase 
(Promega, Madison WI) . The ligation is allowed to incubate for 
about 2 hours at room temperature. The hybrid! zedUigated 
fragments are generally purified on 2% Nusieve gels before 
and\or after cutting with restriction enzymes prior to cloning 
into vectors. A 20 ul volume ligation reaction is assembled 
using 100 ng to 500 ng of each fragment with approximate 
eguimolar amounts of DNA in 30 mM Tris-HCl pH 7.8, 10 mM MgC12, 
10 mM DTT, 1 mM ATP, and 3 units of T4 DNA ligase (Promega, 
Madison, WI) . Ligations are incubated at room temperature for 
2 hours. After ligation, DNA is transformed into frozen 
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content coli cells using standard procedures (Sambrook et 
al.) and transf ormants are selected on LB-agar (Sambrook et 
al.) containing 100 pg/ml ampicillin (see below). 

PCR Reactions for Screening clones in K. coli: 

g. coli colonies which contain the correct DMA insert 
are identified using PCR (see generally, Sandhu et al., 
RJoTechnioues 7:689-690 (1989)). Using a toothpick, colonies 
are scraped from an overnight plate and added to a 20 pi to 45 
U l PCR reaction mix containing about 50 pmoles of each 
hybridizing primer (see example using primers MK23A28 and 
MK25A28 to select orientation of SacII fragment in P HYB2#6) , 
200 ym to 400 mM of each dNTP, and IX reaction buffer (Perkin 
Elmer Cetus, Norwalk, CT) . After boiling the coli\PCR mix 
in a boiling water bath for 10 minutes, 5 ul of Taq polymerase 
(0.5 units) (Perkin Elmer Cetus, Norwalk, Conn.) in IX reaction 
buffer is added. The PCR reaction parameters are generally sei 
with a denaturing step of 94-C for 30 seconds, annealing at 
55.C for 45 seconds, and extension at 72-C for 45 seconds for 
30 to 36 cycles. PCR reaction products are run on agarose or 
Nusieve agarose (PMC) gels to detect the correct fragment size 
amplified. 



Ligations : 

Restriction enzyme digested fragments are either 
purified in 1% LGT (low gelling temperature agarose, FMC) , 2% 
Nusieve (FMC), or 0.75% agarose using techniques standard in 
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the art. DNA bands are visualized with ethidium bromide and 
bands are recovered from gels by excision with a razor blade. 
Fragments isolated from LGT are ligated directly in the LGT. 
Ten microliters of each recovered DNA fragment is used to 
assemble the ligation reactions, producing final ligation 
reaction volumes of about 23 ul. After excision with a razor 
blade, the recovered gel bands containing the desired DNA 
fragments are melted and brought to IX ligase buffer and 3 
units of T4 DNA ligase (Promega) are added as described above. 
Fragments isolated from either regular agarose or Nusieye 
agarose are purified from the agarose using ultrafree-MC 0.45 

■n 

pM filter units (Millipore) and the fragments are ligated as 
described above. Ligation reactions are incubated at room 
temperature for two hours before transforming into frozen 
competent E. coli cells using standard procedures (Sambrook et 
al . ) . 



Transformations : 

Frozen competent coli cells of the strain DHSalpha 
or HB101 are prepared and transformed using standard procedures 
(Sambrook et al.). E. Coli "SURE" competent cells are obtained 
from Stratagene (La Jolla, CA) . For ligations carried out in 
LGT agarose, after ligation reactions are complete, 50 mM CaC12 
is added to a final volume of about 150 pi and the solution 
heated at approximately 65 »C for about 10 minutes to completely 
melt the agarose. The solution is then mixed and chilled on 
ice for about 10 minutes before the addition of about 200 pi of 
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competent cells which had been thawed on ice. This mixture is 
allowed to incubate for 30 minutes on ice. The mixture is next 
fceat shocked at 42-C for 60 seconds before chilling on ice for 
two minutes. Next, 800 ul of SOC media (20% tryptone, 0.5% 
yeast extract, 10 mM Nad, 2.5 mM KC1, adjusted to P H 8 with 5 
N NaOH, 20 mM MgC12:MgS04 mix, and 20 mM glucose; Sambrook et 
al.) is added and the cells are incubated at 37-C with shaking 
for about one hour before plating on selective media plates . 
Plates typically are L-agar (Sambrook et al.) containing 100 

yg/ml ampicillin. 

When ligations are carried out in a solution without 
agarose, typically 200 ul of frozen competent E, coli cells 
(strain DH5alpha (BRL, Gaithersburg, MD or Sure cells, 
Stratagene, La Jolla, CA) are thawed on ice and 5 ul of the 
ligation mixture added. The reaction is incubated on ice for 
about 45 to 60 minutes, the cells are then heat shocked at 42- 
for about 90 seconds. After recovery at room temperature for 
about 10 minutes, 800 ul of SOC medium is added and the cells 
are then incubated 1 hour at 37-C with shaking and plated as 



When screening for inserts into the beta-galactosidase 
gene in some of the standard vectors used, 200 ul of the 
recovered transformation mixture is plated on LB-agar plates 
containing 0.008% X-gal, 80 uM XPTG, and 100 ug/ml ampicillin 
(Sambrook et al.) . The plates are incubated at 37- overnight 
to allow selection and growth of transf ormants . 
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Miniscreening DNA: 

Trans fonnants from the selective media plates are grown 
and their plasmid structure is examined and confirmed using 
standard plasmid mini-screen procedures (Sambrook et al.) - 
Typically, the "boiling" procedure is used to produce small 
amounts of plasmid DNA for analysis (Sambrook et al . ) . 
Alternatively, an ammonium acetate procedure is used in some 
cases. This procedure is a modification of that reported by 
Shing-yi Lee et al., Biotechnigues 9:676-679 (1990). 

1) Inoculate a single bacterial colony from the 
overnight selection plates into 5 ml (can be scaled down to 1 
ml) of TB (Sambrook et al.) medium and grow in the presence of 
the appropriate antibiotic. 

2) Incubate on a roller at 37 # C overnight. 

3) Collect 5 ml of bacterial cells in a plastic 
Oakridge tube and spin for 5 min. at 5000 rpm in a Sorvall 
SS-34 rotor at 4»C. 

4) Remove the supernatant. 

5) Resuspend the pellet in 1 ml of lysis buffer (50 mM 
glucose, 25 mM Tris-HCl[pH 8.0], 10 mM EDTA and 5 mg/ml 
lysozyme) , vortex for 5 seconds, and incubate at room 
temperature for 5 min. 

6) Add 2 ml of freshly prepared alkaline solution (0.2 
N NaOH, 1% sodium dodecyl sulfate), tightly secure lid, mix by 
inverting 5 times and place tube in an ice-water bath for 5 
min. 

7) Add 1.5 ml of ice-cold 7.5 M ammonium acetate (pH 
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7.6) to the solution, mix by inverting the tube gently 5 times 
and place on an ice-water bath for 5 min. 

8) Centrifuge mixture at 9000 rpm for 10 min. at room 

temperature . 

9) Transfer clear supernatant to a 15 ml Corex tube and 
add 0.6 volumes of isopropanol (approx. 2.5 ml). Let sit at 

room temperature for 10 min. 

10) Centrifuge the mixture at 9000 rpm for 10 min. at 

room temperature and discard the supernatant. 

11) Resuspend the pellet in 300 ul of TE buffer. Add 6 
ul of a stock of RNase A t Tl (made as a 200 ul solution by 
adding 180 ul of RNase A [3254 Units/mg protein, 5.6 mg 
protein/ml] and 20 ul of RNase Tl[481 Units/ug protein, 1.2 mg 
protein/ml] ) . These stocks may be purchased from USB (US 
Biochemical) . Transfer to a microcentrifuge tube and incubate 

at 37 »C for 15 min. 

12) Add 75 ul of distilled water and 100 ul of 7.5 M 
ammonium acetate and incubate in an ice-water bath for 10 min. 

13) Centrifuge the mixture at 14,000 rpm for 10 min. in 

a Beckman microfuge at 4»C. 

14) Precipitate by adding 2.5 volumes of 100% EtOH 
(approx. 1 ml) and incubate in an ice-water bath for 10 min. 

15) Spin at 14,000 rpm for 10 min. in a microfuge. 

16) wash pellet with 70% ethanol (using 0.5 ml-1 ml) . 
Dry the pellet and resuspend in 100 ul of IX New England 
Biolabs restriction enzyme Buffer 4 [20 mM Tris-HCKpH 7.9), 10 
mM magnesium acetate, 50 mM potassium acetate, 1 mM DTT] . 
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Measure concentration and check purity by spectrophotometry at 

absorbances 260 and 280 nm. 

For a more rapid determination as to whether or not a 
particular bacterial colony harbored a recombinant plasmid, a 
PCR miniscreen procedure is carried out using a modification of 
the method described by (Sandhu, G.S. et al., 1989, 
BioTechniques, 7:689-690). Briefly, the following mixture is 
prepared: 

100 pi primer mix above, 20 yM each primer, 
100 ul dNTP mix (2.5 mM each) 

100 ul 10X AmpliTaq buffer (Perkin-Elmer Cetus, IX 
buffer - 10 mM Tris-HCl pH 8.3, 50 mM KC1, 1.5 mM MgC12, and 

0.01% gelatin) 

700 ul deionized water. 

20 \xl of the above mixture is put into a a 0.5 ml 
polyproplyene PCR tube. A transformed bacterial colony is 
picked with a toothpick and resuspended in the mixture. The 
tube is put in a boiling water bath for 10 minutes and then 
cooled to room temperature before adding 5 ul of the mix 

described below: 

265 ul deionized water 

30 ul 10X Amplitaq buffer (Perkin-Elmer Cetus, IX 
buffer = 10 mM Tris-HCl pH 8 . 3, 50 mM KC1, 1.5 mM MgC12, and 
0.01% gelatin) 

7.5 ul Tag polymerase 
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^ ^oriAid with 50 ul of mineral oil and 
The samples are overlaid wicn 

PGR is carried out for 30 cycles using the following 



denature: 94 • for 1 min 
anneal: 55 • for 1 min 
extend: 72* for 45 seconds. 

After PCR amplification, 1 pi of loading dye (30% 
glycerol, 0.25% Bromophenol blue, 0.25% xylene cyanol) is added 
to the whole reaction and 20 ul of the mixture is loaded on a 
2% Nusieve, 1% agarose gel to see if there is a PCR product of 

the expected size. 

This procedure is used as an initial screen. Minipreps 
are subsequently carried out to confirm the structure of the 
plasmid and its insert prior to sequencing. 

*v*mpt.E 21 Ag g LEFICATION AND ASSEMBLY OF EACH 9™™ 
Cloning fragments of the synthetic Bt crylA(b) gene. 

The synthetic gene was designed to be cloned in four 
pieces, each roughly one quarter of the gene. The oligomers 
for each quarter were pooled to either be assembled by PCR, 
hybridization, or a combination of hybridization followed by 
PGR amplification as described elsewhere. Synthetic quarters 
were pieced together with overlapping restriction sites Aat II, 
Ncol, and Apa I between the 1st and 2nd, 2nd and 3rd, and 3rd 
and 4th quarters respectively. 

Each quarter of the gene (representing about 500 bp) 
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was assembled by hybridizing the appropriate oligomers and 
amplifying the desired fragment using PCR primers specific for 
the ends of that quarter. Two different sets of PCR reactions 
employing two sets of slightly different primers were used. 
The PCR products of the two reactions were designed to be 
identical except that in the first reaction there was an 
additional AATT sequence at the 5' end of the coding region and 
in the second reaction there was an AGCT sequence at the 3' end 
of a given quarter. When the products of the two reactions for 
a particular quarter were mixed (after removing the polymerase, 
primers and incomplete products) , denatured, and subsequently 
re-annealed, a certain ratio (theoretically 50%) of the 
annealed product should have non-homologous overhanging ends. 
These ends were designed to correspond to the "sticky ends" 
formed during restriction digestion with BcoRI at the 5' end 
and Hind III at the 3' end of the molecule. The resulting 
molecules were phosphorylated, ligated into an EcoRI/Hindlll 
digested and phosphatased Bluescript vector, and transformed 
into frozen competent coli strain DHSalpha. After 
selection, the coli colonies containing the desired fragment 
are identified by restriction digest patterns of the DNA. 
Inserts representing parts of the synthetic gene are 
subsequently purified and sequenced using standard procedures. 
In all cases, clones from multiple PCR reactions are generated 
and sequenced. The quarters are then joined together using the 
unique restriction sites at the junctions to obtain the 
complete gene . 
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Cloned quarters are identified by mini-screen 
procedures and the gene fragment sequenced. It is found that 
errors are frequently introduced into the sequence, most 
probably during the PGR amplification steps. To correct such 
errors in clones that contain only a few such errors, 
hybridized oligomers are used. Hybridized fragments are 
digested at restriction enzyme recognition sites within the 
fragment and cloned to replace the mutated region in the 
synthetic gene. Hybridized fragments range from 90 bp in 
length (e.g. the region that replaces the fragment between the 
Sac II sites in the 2nd quarter) to the about 350 bp 4th 
quarter fragment that replaces two PGR induced mutations in the 

4th quarter of the gene. 

Due to the high error rate of PCR, a plasmid is 
designed and constructed which allows the selection of a cloned 
gene fragment that contains an open reading frame. This 
plasmid is designed in such a manner that if an open reading 
frame is introduced into the cloning sites, the transformed 
bacteria could grow in the presence of kanamycin. The 
construction of this vector is described in detail below. This 
selection system greatly expedites the progress by allowing one 
to rapidly identify clones with open reading frames without 
having to sequence a large number of independent clones . The 
synthetic quarters are assembled in various plasmids, including 
BSSK (Stratagene; La Jolla, Ca) , P UC18 (Sambrook «•!.), and 
the Km-expression vector. Other suitable plasmids, including 
PUC based plasmids, are known in the art and may also be used. 
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Complete sequencing of cloned fragments, western blot analysis 
of cloned gene products, and insect bioassays using European 
corn borer as the test insect verify that fully functional 
synthetic Bt crylA(b) genes have been obtained. 

Construction of the Km-expression vector to select open reading 



The Km-expression vector is designed to select for fragments of 
the synthetic gene which contain open-reading frames. PCR 
oligomers are designed which allow the fusion of the NPTII gene 
from Tn5 starting at nucleotide 13 (Reiss et al., EMBO J- 
3:3317-3322 (1984)) with pUC18 and introduce useful restriction 
sites between the DNA segments. The polylinJcer region contains 
restriction sites to allow cloning various synthetic Bt IP 
fragments in-frame with the Km gene. The 88 bp 5' oligomer 
containing the polylinker region is purified on a 6% 
polyacrylamide gel as described above for the oligomer PAGE 
purification. A PCR reaction is assembled with a 1 Kb Bgl 
H\Sma I template fragment which contains the NPT II gene 
derived from Tn5. The PCR reaction mix contains 100 ng of 
template with 100 pmols of oligomers KE72A28 and KE74A28 (see 
sequences below), 200 nM dNTP, and 2.5 Units of Taq polymerase 
all in a 50 ul volume with an equal volume of mineral oil 
overlaid. Sequences of the primers are: 



KE74A28 

5 ' -GCAGATCTGG ATCCATGCAC GCCGTGAAGG GCCCTTCTAG AAGGCCTATC 
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GATAAAGAGC TCCCCGGGGA TGGATTGCAG GCAGGTTC-3' 



KE72A28 

5 ' -GCGTTAACAT GTCGACTCAG AAGAACTCGT CAAGAAGGCG-3 ' 

The PGR parameters used are: 94-C for 45 seconds (sec), 
55-C for 45 sec, and 72-C for 55 sec with the extension at step 
3 for 3 sec for 20 cycles. All PGR reactions are carried out 
in a Perkin-Elmer Cetus thermocycler . The amplified PGR 
product is 800 bp and contains the polylinJcer region with a 
translational start site followed by unique restriction sites 
fused in-frame with the Km gene from base #13 running through 
the translational terminator. pUC:KM74 is the Km-expression 
cassette that was assembled from the 800 bp Bgl II\Sal I 
polylinker/Km fragment cloned in the PUC18 vector. The lacZ 
promoter allows the Km gene to be expressed in ^ coli . 
PUC.KM74 derivatives has to first be plated on LB-agar plates 
containing 100 pg/ml ampicillin to select transformants which 
can subsequently be screened on LB-agar plates containing 25 
M/.L fcanamycin/IPTG. Synthetic Bt IP gene fragments are 
assembled from each quarter in the Km-cassette to verify 
cloning of open-reading- frame containing fragments pieces. The 
first ECB active synthetic Bt IP gene fragment, P Bt:Km#6, is a 
Bt IP gene that shows Km resistance. This fragment is 
subsequently discovered to contain mutations in the 3rd and 4th 
quarter which are later repaired. 
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EXAMPLE 2A: SYNTHESIS AND CLONING OF THE FIRST QUARTER OF THE 

SYNTHETIC GENE fbase pair s 1 to 5501 

The following procedures are followed in order to clone 
the first quarter of the synthetic DNA sequence encoding^ a 
synthetic Bt crylA(b) gene* The same procedures are 
essentially followed for synthesis and cloning of the other 
quarters, except as noted for primers and restriction sites. 

Template for Quarter 1: Mixture of equal amounts of purified 
oligomers U1-U7 and LI to L7 



PCR Primers: 
Forward: 

PI (a): 5' -GTCGACAAGG ATCCAACAAT GG-3' 

PI (b) : 5' -AATTGTCGAC AAGGATCCAA CAATGG-3' 



Reverse : 

P2 (a): 5'-ACACGCTGAC GTCGCGCAGC ACG-3' 
P2 (b) : 5'-AGCTACACGC TGACGTCGCG CAG-3' 

Primer pair Al : Pl(b) + P2 (a) 
Primer pair A2 : PI (a) + P2 (b) 

The PCR reaction containing the oligomers comprising 
the first quarter of the synthetic maize-optimized Bt IP gene 
is set up as follows: 
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200 ng oligo mix (all oligos for the quarter mixed in 

equal amounts based on weight) 

10 ul of primer mix (1:1 mix of each at 20 uM; 

primers are described above) 

5 ul of 10X PCR buffer 

PCR buffer used may be either 

(a) IX concentration = 10 mM KC1, 10 mM <NH4)2S04, 20 
n* Tris-HCl, P H 8.0, 2 mM MgS04, and 0.1% Triton X-100) , or 

(b) IX concentration = 10mM Tris-HCl P H 8.3, 50 mM KCl 
1.5mM MgCi 2 , 0.01% wt/vol gelatin. 

Components are mixed, heated in a boiling water bath 
for 5 minutes, and incubated at 65 «C for 10 minutes. 
Next, the following reagents are added: 
8 ul of dNTPs mixture (final concentration in the 

reaction = 0.2 mM each) 

5 units polymerase. 

The final reaction volume is 50 microliters. 

Oligomers are then incubated for 3 min at 72-C and then 
a PCR cycle is run. The PCR reaction is run in a Perxin Elmer 
thermocycler on a step cycle protocol as follows: 

denaturation cycle : 94* for 1 minute 

annealing cycle : 60* for 1 minute 

extension cycle : 72- for 45 seconds ( + 3 sec per 



cycle) 



number of cycles: I 5 
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After the reaction is complete , 10 pi of the PCR 
reaction is loaded on a 2% Nusieve-GTG (FMC) , 1% agarose 
analytical gel to monitor the reaction. The remaining 40 pi is 
used to clone the gene fragments as described below. 

PCR Products 

The termini of the double stranded PCR product 
corresponding to the various primer pairs are shown (only upper 
strand) : 

Al AATTGTCGAC GCGTGT (554 bp) first qtr. 

A2 GTCGAC_ GCGTGT AGCT (554 bp) first qtr. 

Hybridization 

40 pi of each of the PCR reactions described above is 
purified using a chromaspin 400 column (Clonetech, Palo Alto, 
CA) according to manufacturers directions. Five pg of carrier 
DNA was added to the reactions before loading on the column. 
(This is done for most of the cloning. However, in some 
reactions the PCR reactions are phenol: chloroform extracted 
using standard procedures (Sambroofc et al.) to remove the Taq 
polymerase and the PCR generated DNA is recovered from the 
aqueous phase using a standard ethanol precipitation 
procedure.) The carrier DNA does not elute with the PCR 
generated fragments . The Al and A2 reaction counterparts for 
each quarter are mixed, heated in a boiling water bath for 10 
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^es and ^ incubated a, S5-C ove^t- - "actions 
are then removed from the 65- bath and ethanol predated 
with 1 Ul (20 ug> of nuclease free glycogen (Tracy, Preo. 
Biochenu 11:251-268 (1961) as carrier. The pellet is 
resuspended in 40 pi of deionized water. 

Phosphorylation reaction 

The phosphorylation reaction is carried out as follows: 

40 ul DNA 

2.5 Hi 20 mM ATP 

0.5 Ul 10X BSA/DTT (IX = 5 mM DTT, 0.5 mg/ml BSA) 
1.0 ul 10X polynucleotide kinase buffer (IX = 70 



mM Tris.HCl, 



pH 7.6, 0.1 M KC1, 10 mM MgC12) 

2.0 ul polynucleotide kinase (New England Biolabs, 



20 units) 



Incubation is for 2 hours at 37 •C. 

4-k«« Dvfracted one time with a 1:1 
The reaction xs then extracted 

phenol: chloroform mixture, then once with chloroform and the 
aq ueous Phase ethanol precipitated using standard procedures. 
The pellet is resuspended in 10 ul of TE. 



Restriction Digests 
20 ug 



of Bluescript vector (BSSK+, Stratagene, La 



Jolla, CA) 

, n v = 20 mM Tris-HCl pH 

10 ul 10 X restriction buffer (IX ^ ™" 
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8.0, 10 mM MgC12, 100 mM NaCl) 

5 ul Eco RI (New England Biolabs) 100 units 
5 ul Hind III (New England Biolabs) 100 units 
Final reaction volume is 100 ul. 
Incubation is for 3 hours at 37 
When completed, the reaction is extracted with an equal volume 
of phenol saturated with TE (10 mM Tris.HCl pH 8.0 and 10 mM 
EDTA) . After centrifugation, the aqueous phase was extracted 
with an equal volume of 1 : 1 mixture of (TE saturated) 
phenol: chloroform (the "chloroform" is mixed in a ratio of 24:1 
chloroform :isoamyl alcohol), and finally the aqueous phase from 
this extraction is extracted with an equal volume of 
chloroform. The final aqueous phase is ethanol precipitated 
(by adding 10 ul of 3 M sodium acetate and 250 ul of absolute 
ethanol, left at 4» for 10 min and centrifuged in a microfuge 
at maximum speed for 10 minutes. The pellet is rinsed in 70% 
ethanol and dried at room temperature for 5-10 minutes and 
resuspended in 100 ul of 10 mM Tris.HCl (pH 8.3). 

Phosphatase reaction 

Vector DNA is routinely treated with phosphatase to 
reduce the number of colonies obtained without an insert . Calf 
intestinal alkaline phosphatase is typically used (Sambrook et 
al.), but other phosphatase enzymes can also be used for this 
step. 

Typical phosphatase reactions are set up as below: 
90 ul of digested DNA described above 
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10 Ul of 10X Calf intestinal alkaline phosphatase 

_ M „ . Hr1 , nH 83) , 10 mM MgC12, 1 mM ZnC12, 10 
buffer (1X=50 mM Tris-HCl (pH B.JJf 

mM spermidine) 

1 U l (1 unit) of calf intestinal alkaline phosphatase 

(CIP, Boehringer Mannheim, Indianapolis, IN) 

incubation is at 37 «C for 1 hour. 

The DNA is then gel purified (on a 1% low gelling 
temperature (LGT) agarose gel, and the pellet resuspended in 50 
U l TE After electrophoresis, the appropriate band is excised 
from the gel using a razor blade, melted at 65- for 5 minutes 
and diluted 1:1 with TE. This solution is extracted twice with 
phenol, once with the above phenol chloroform mixture, and once 
with chloroform. The final aqueous phase is ethanol 
precipitated and resuspended in TE buffer. 

Ligation : 

To ligate fragments of the synthetic gene into vectors, 
the following conditions are typically used. 

5 ul ° f phosphorylated insert DNA 

2 ul of phosphatased Eco RI/Hind III digested 
Bluescript vector heated at 65- for 5 minutes, then cooled 

1 ul 10X ligase buffer (IX buffer=30 mM Tris.HCl ( P H 
7.8), 10 mM MgC12 r 10 mM DTT, 1 mM ATP) 

1 ul B SA < X m 9/ m l> 

1 ul Ugase (3 units, Promega, Madison, Wise.) 
Ligase reactions are typically incubated at 16* 
overnight or at room temperature for two hours. 
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Transformation : 

Transformation of ligated DNA fragments into coli is 
performed using standard procedures (Sambrook et al.) as 

described above. 
Identification of recombinants 

White or light blue colonies resulting from overnight 
incubation of transformation plates are selected. Plasmids in 
the transformants are characterized using standard mini-screen 
procedures (Sambrook et al.) or as described above. One of the 
three procedures listed below are typically employed: 

(1) boiling DNA miniprep method 

(2) PCR miniscreen 

(3) Ammonium acetate miniprep. 

The restriction digest of recombinant plasmids believed 
to contain the first quarter is set up as follows: 

(a) Bam HI/Aat II digest: 10 pi DNA + 10 pi IX New 
England Biolabs restriction enzyme Buffer 4 

0.5 pi Bam HI (10 units) 

0.5 pi Aat II (5 units) 

Incubation is for about 2 hours at 31 9 C. 

Clones identified as having the desired restriction 
pattern are next digested with Pvu II and with Bgl II in 
separate reactions. Only clones with the desired restriction 
patterns with all three enzyme digestions are carried further 
for sequencing . 

Sequencing of cloned gene fragments: 
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sequencing is performed using a modification of 
Sanger's dideoxy chain termination method (Sambrook et al.) 
using double stranded DNA with the Sequenase 2 kit (United 
States Biochemical Corp., Cleveland, OH). In all, six first 
guarter clones are sequenced. Of the clones sequenced, only 
two clones designated pQAl and P QA5 are found to contain only 
one deletion each. These deletions are of one base pair each 
located at position 452 in pQAl and position 297 in P QA5. 

Plasmid PQAl is used with pPl-8 (as described below) to 
obtain a first quarter with the expected sequence. 

EXAMPLE 2B: SYNTHESIS AND CLONING OF JHESECOND 

QUARTER [base pa irs 531 to maui 

Template: oligomers U8-U14 and L8-L14 

PCR Primers: 

forward: 

P3 (a): 5'-GCTGCGCGAC GTCAGCGTGT TCGG-3' 
P3 (b) : 5 ' -AATTGCTGCG CGACGTCAGC GTG-3' 



Reverse : 

P4 (a): 5 ' -GGCGTTGCCC ATGGTGCCGT ACAGG-3' 
P4 (b): 5'-AGCTGGCGT TGCCCATGGT GCCG-3' 

Primer pair Bl: P3 (b) + P4 (a) 
Primer pair B2: P3 (a) + P4 (b) 

PCR Products 
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Bl AATTGCTGCG AACGCC (524 bp) second 

quarter 



B2 



GCTGCG AACGCCAGCT (524 bp) 



Hybridization, PCR amplification, spin column size 
fractionation, and cloning of this gene fragment in Bluescript 
digested with Eco RI/Hind III are performed as described above 
for the first quarter (Example 2A) . The PCR product for this 
quarter is about 529 bp in size representing the second quarter 
of the gene (nucleotides 531 to 1050) . Transformation is into 
frozen competent E. coli cells (DH5alpha) using standard 
procedures described above (SambrooJc et al.) 

Miniscreen of pQB clones: 

Miniprep DNA is prepared as described above and 
digested with (a) Aat II/Nco I, (b) Pvu II and (c) with Bgl I 
to confirm the structure insert in the vector before 
sequencing. 

Sequencing is performed as described above using the 
dideoxy method of Sanger (SambrooJc et al.). 

A total of thirteen clones for this quarter are 
sequenced. The second quarter consistently contains one or 
more deletions between position 884 and 887. In most cases the 
G at position 884 is deleted. 

Plasmid pQB5 had only one deletion at position 884. 
This region lies between two Sac II sites (positions 859 and 
949) . correction of this deletion is described in Example 3. 
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Clones of the first half (1-1050 bp) . 

A fragment for cloning the first half (quarters 1 and 
2) of the synthetic Bt maize gene as a single DNA fragment is 
obtained by restriction digestion of the product of a PGR 
reaction comprising the first quarter and the second quarter. 
Restriction endonuclease Aat II is used to cut the DNA 
(following phenol extraction and ethanol precipitation), in a 20 
W l reaction. 15 ul of each of the Aat II digested quarters is 
mixed and ligated (in a 50 ul volume by adding 5 ul of 10X 
ligase buffer, (1X-30 mM Tris-HCl P H 7.8, 10 mM MgC12, 10 mM 
DTT, 1 mM ATP) 14 ul of deionized water and 1 Ul of T4 DNA 
ligase, 3 units, Promega, Madison, WI) at room temperature for 
2 nr. The result is an about 1 kb fragment as judged by 
electrophoresis on a 1% agarose gel run using standard 
conditions (Sambrook et al.) Ten ul of the ligation product is 
amplified by PGR using conditions described previously except 
that only 5 cycles were run. 

Primer Pair: HA- Pi (a) + P4 (b) 
Primer Pair: HB« PI (b) + P4 (a) 

The product of these reactions is cloned into 
Bluescript (Stratagene, La Jolla, CA) as described for the 
individual quarters. This procedure is only done once i.e., 
all insert DNA is obtained in a particular region from a single 
PCR reaction. 

Thirty-six colonies are miniscreened with Sal I digests 
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and Pvu II digests. All except 4 contain an insert of 
approximately 1 kb in size of which at least 20 contain the 
correct Pvu II digestion pattern. Eight of these clones are 
selected for sequence analysis. One of the clones, Pl-8, has 
the desired sequence between the Eco NI site (396 bp) and the 
Dra III site (640 bp) . This clone is used to obtain a plasmid 
with the desired sequence up to the Dra III site (640 bp) in 
the second quarter with pQAl (first quarter with a deletion at 
position 452 bp described previously.) 

EXAMPLE 2C: CLONING AND SYNTHESIS OF THIRD 

QUARTER [base pairs 1021 to 1500] 

Template: Oligos U15-U20 and L15-L21 

PCR primers: 

forward 

P5 (a): 5'-TTCCCCCTGT ACGGCACCAT GGGCAACGCC GC-3' 
P5 (b) : 5' -AATTGTACGG CACCATGGGC AAC-3' 



P6 (a): 5' -GAAGCCGGGG CCCTTCACCA CGCTGG-3' 
P6 (b) : 5' -AGCTGAAGCC GGGGCCCTTC ACC-3' 



Primer pair CI: P5 (b) + P6 (a) 
Primer pair C2: P5(a) + P6(b) 



PCR Product: 

CI AATTGTACGG G GCTTC (475 bp) 3d qtr 
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GGCTTCAGCT (4 84 bp) 3d qtr 
C2 TTCCCCTGTACGG . — 

PC* reactions, spin coll— recovery of the correct 
sired DNA f recent, and ligation into vectors are performed as 
described above (Example 2A> using a Bluescript vector cut »th 
eco RI and Bind III. The approaimately 479 base parr PC* 
product represents the third garter of the synthetic gene <NT 

1021 - 1500) . 

Transformation into frosen competent L £2i4 
DHSalpha cells, selection and identification of transf chants . 
characterisation of transforms by mini-screen procedures. 
,„d sequencing of the synthetic gene fragment in the vector are 
all as described above. 



Mini screen of pQC clones: 

The third quarter is miniscreened using standard 
procedures .Sambroo* et al... Hiniprep DHA is cut with <a, Hco 
I/Apa 1 and (b, with Pvu II . Clones containing the correct 
restriction digest patterns are 'sequenced using standard 
procedures. A total of 22 clones of the third quarter are 
sequenced. Three major deletion -hotspots- in the third 
garter are identified (a. at position 1083 (b, between 
position 1290-1397 and (c. between positions 1356-1362. In all 

~<->ch there is also consistently an 
clones except one, pQC8, tnere j-s 

insertion of a C at position 1365. In addition to these 
stations, the third quarter clones contain a large nunfcer of 
other apparently random deletions . The common factor to the 
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three mutational "hotspots" in the third quarter and the one in 
the second quarter is that these regions are all flanked on 
either side by sequences that are about 80% C+G. Other regions 
containing 5 to 9 C-Gs in a row are not affected. The 
oligomers in U15, U16, 018, U19, L15, L16, L18 and L19 are 
redesigned to reduce the C+G content in these regions. Five 
clones each from PCR reaction using the modified oligomers are 
sequenced . 

Plasmid pQCN103 has the correct sequence for the third 

quarter except for a change at position 1326. This change, 

which substitutes a G for a C, results in the substitution of 

one amino acid (leucine) for the original (phenylalanine) . 

EXAMPLE 2D: SYNTHESIS AND CLONING OF FOURTH 

QUARTER fbase pairs 1480 to 19601 

The fourth quarter of the gene is obtained from a clone 
which is originally designed to comprise the third and fourth 
quarters of the gene. The "second half of the synthetic gene 
is obtained from PCR reactions to fuse the third and fourth 
quarters. These reactions are run with PCR primers P5 (a) and 
P6(a) described above for the third quarter and primers P7 (a) 
and P8 (a) (described below) . The reverse primer is modified to 
include a Sac I site and a termination codon. Separate 
reactions for each quarter are run for 30 cycles using the 
conditions described above. The two quarters are joined 
together by overlapping PCR and subsequently digested with 
restriction enzymes Nco I and Sac I. The resulting 953 bp 
fragment is cloned directionally into P CIB3054, which has been 
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cue „ith N=o I/Sac I and treated with alxaline phosphatase . 

pCIB3054 ia constructed by inserting intron #9 of 
PEPcarboxylase (PEPC ivs .9) in the unique Hpa I site of 
pCIB246 .described in detail in Example 4, PGTB246 is cut with 
HP.I and phosphatased with CIP using standard procedures 

. ™ pepc ivs #9 is obtained by PCK using 
described in Example 2*. PEPC ivs 

pPEP-10 as the template. pPEP-10 is e genomic subclone 
containing the entire maize PEP carboxylase gene encoding the 
C Photosynthetio enryme. plus about 2.2 Kb of 5-f landing and 
I t Kb of 3'-flan*ing DNA. The 10Kb DNA is ligated in the 
HindHI Site of POC19 . (Hudspeth et al . . Pl ant Molecular , 
Bioxogy , 12= 516-589 (19S9) . The forward PGR primer used to 
obtain the PEPCivs»9 is GTACAAAAACCAGCAACTC and the reverse 

primer is CTGCACAAAGTGGAGTAGT . The PGR product is a 108 bp 

fragment containing only the PEPcarboxylase intron 19 
sequences . The PGR reaction is extracted with phenol 
„ d chloroform, eth»ol precipitated phosphorylated with 
polynucleotide xin.se and treated with T4 polymerase to fill « 
the 3' contemplated b.ae addition found in PGR products (ClarX. 

ie. QS77-9686 (1988)) using 
j.M., Nucleic Aci d Research, 16: 9677 3t>oo 

standard procedures. The fcinased fragment is blunt-end cloned 
into the Hpal site of PCIB246, using standard procedures 



Amplification and Assembly of the Fourth Quarter 
Template: U21-U26 and L22-L28 
pr.R primers 
FORWARD 
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P7 (a): 5'-TGGTGAAGGG CCCCGGCTTC ACCGG-3' 

REVERSE 

P8 (a): 5'-ATCATCGATG AGCTCCTACA CCTGATCGAT GTGGTA-3' 

PRIMER PAIR 4: P7 (a) + P8 (a) 
PRIMER PAIR 3: P5 (A) + P6(a) 

Primer pair for overlapping PGR : P7(a) + P8 (a) 
PCR Product 

fourth quarter: GGTGAA ATCAGGAGCTCATCGATGAT 

(484 bp) third quarter: TTCCCCCTGTA TTCACCGG 

(484 bp) second half: GGTGAA CATGATGAT (953 bp) 

Four positive clones are identified by plasmid 
miniscreen and are subsequently sequenced using standard 



Plasmid Bt.P2 #1 contains approximately the correct 
fourth quarter sequence except for two mutations. These 
mutations are at position 1523 (substituting an A for a G, 
resulting in an amino acid change which substitutes a His for 
an Arg) and at position 1634 (substituting a T for a C, 
resulting in an amino acid substitution of a Ser for a Thr) . 

Plasmid Bt.P2 #1 is used in the construction of 
PCIB4414 described below. (The mistakes are ultimately 
corrected by hybridizing all the oligos of the fourth quarter, 
digesting with Apa I/Bst E II and replacing that region in 
PCIB4414. Therefore, only sequences from position 1842-1960 
remain of Bt.P2#l in the final construct.) 
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r . uss&s * - " p THE M SYWTHE " C 56 - 

The synthetic maize optimised Bt cryIA<b, gene rs 

- <- _ aI cais Using the PCR technique, 
designed to be cloned in quarters. 

however, results in mutations, which in most cases are 
deletions resulting in f rameshift mutations . Plasmrds 
containing individual quarters are therefore sequenced and the 
correct parts ligated together using standard procedures. 

After obtaining first end second quarter clones with 
iln ost the desired sequence, plasmids pEBl«4 and pEBlu.5 are 
instructed to obtain the desired sequence cf the synthetic Bt 
9 ene up to the ara XXI site at the base pair position 634 .this 
station destroys the Ore XXX site, . The pBBlQ constructs are 
made by ligetin, a 3.9 Kb Eco KI\Bam HI fragment from pPX-8 
with a 400 bp fragment from pQAl. PEBlwIS has the desired 

, tMrwio*4 has a mutation at 
sequence up to the Dra III site, but P EB1Q*4 ha 

base pair position 378. 

Plasmids P1H1M4 and plHlMS are constructed to reparr 
the Dra XXX sit. in PBB10.4 and pEBXQ#5 . Plasmids pXHIMM and 
tS are made by llgating a 3.5 Kb Hco XXAat XX fragment from 
pBB lQ#4 and #5 resp«tiv=ly, with a 500 bp Hoc X\Aat II 

- „na 5 Plasmid PlHlMS contains a mutation between 
fragment from pQ85. Fiasmro 

the sac XX sites at position 084 in the second quarter of 
synthetic Bt gene. Plasmid P1B1M4 contains the edditional 
mutation as described in its precursor construct pEBlQM . 

- fht, m uescriPt vector region of 
The Sac II site in the Bluescrxpu 

„i«im4 with Not I and Sac I and 
P 1H1M4 is deleted by cutting P 1H1M4 witn wo 
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converting these sites to blunt ends using T4 DNA polymerase 
under standard conditions before ligating this 3.9 Kb fragment 
to make plHlM4~S. Deleting the Sac II site in the vector 
region allows the 90 bp Sac II fragment with the mutation at 
position 884 in the 2nd quarter of P 1H1M4~S to be removed prior 
to replacement with a 90 bp Sac II fragment. Oligomers U\L 12 
and 13 are kinased and hybridized (described above) before 
cutting with Sac II and isolating a 90 bp fragment on a 2% 
Nusieve gel. The Sac II fragment is ligated into the about 3.8 
Kb Sac II cut P1H1M4-S vector which has been phosphatased with 
CIP. The repaired Sac II construct is called pHYB2#6. The 
orientation of the Sac II fragment in P HYB2#6 is detected by 
PCR screening as described earlier using the following primers: 

MK23A28 = 5' -GGGGCTGCGGATGCTGCCCT-3' 
MK25A28 = 5' -GAGCTGACCCTGACCGTGCT-3' 
MK26A2B = 5' -CACCTGATGGACATCCTGAA- 3 ' 

Running the PCR reactions with SO pmoles of primers 
MK23A28 and MK25A28 produces an approximate 180 bp fragment, 

* 

indicating the inserted fragment bounded by the Sac II sites in 
P HYB2*6 is in the correct orientation. Using primers MK25A28 
and MK26A28 in the PCR screening acts as the negative control 
producing an approximate 180 bp fragment only in constructs 
containing the Sac II bounded fragment in the wrong 
orientation. P HYB2#6 sequence is determined using standard 

procedures . 
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nneition 378 which needed 
pHYB2#6 has one mutation at position 

«Mv-«t ouarter containing the desired 
to be repaired to obtain a first quarter 

sequence. 

Plasmid P 1HG»6 contains the desired sequence for 
entire first naif of the synthetic Bt gene. PlBG.6 » — • 
f rom a 3.4 Kb *at XXVNeo I fragment of PlHlMS,2 ligated to a 
500 bp Ut II\Hco I fragment from pHYB2«6. 

To identify ciones or partial clones of the synthetic 
gene which contain open reading frames, tbe Xanamycin selection 
vector .described above, is used. Tbe fourth quarter of the 
synthetic Bt gene is the first put into the tanamycin cassette 
pK M74-4 contains the approximately 500 bp *pa XXCla X fragment 
£ ro» plasmid BtP2 (which had been previously transformed rnto 
aam- E. coli strain <PO-100, to be able to cut with CI, I.. 
Ugated to P OC : K>n, cut with *pa XXCla X. *lasmid 
splays *anamycin resistance but is later found to contain 

••K,„r,«: 1523 and 1634 (mutations 
substitution mutations at positions 1523 an 

• section on cloning the fourth 

are described above xn the section o 

^er; they are substitutions, not deletions or insertions, . 

Tne correct first half of tbe synthetic Bt gene from 
plasmid plHG.o is i»"rted into plasmid PKH74-4. - resulting 
plasmid, called pKm!24, is made from the about ... Kb ^ - am 
HI fragment derived from pKM.4-4 ligated to 1 Kb ^ X am 

. , on, olHG#6 pK»124 shows Icanamycin resistance . This 
fragment from plHGfS. p*"* 

.. ... second, and fourth quarters of the 
plasmid contains the first, secono, 

synthetic gene forming a single open reading frame. 

T„e third quarter of the synthetic gene rs next cloned 
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into P KM124. The first functional clone, in plasmid P Bt:Km#6, 
is a functional copy of the truncated synthetic crylA(b) gene 
in the Km-cassette which displays kanamycin resistance but 
which contains deletion mutations between the third and fourth 
quarters. Plasmid pBt:Km#6 is made from the approximately 5 Kb 
Apa I\Nco I P KM124 vector fragment ligated to the approximately 
500 bp Apa I\Nco I fragment from pQCN103 (pQCN103 contains a 
mismatch mutation at position 1326 which is repaired later) . 
Contaminating nuclease activity appears to have deleted the Apa 
I site between the third and fourth quarters in P Bt:Km#6. The 
Bt gene encoded by the synthetic gene in plasmid P BT:Km#6 has 
about 50-60 % of the native proteins' activity against ECB. 
The 2 Kb Sma I\Bam HI fragment from P Bt:Km#6 is inserted into a 
35S: expression cassette to make a plasmid called 35S:Bt6. 

Two functional synthetic Bt clones, each with 
mutations, are initially obtained: plasmids P BT:KM#6 and 
pCIB4414. PCIB4414, which is 100% active in insect bioassays 
against European corn borer compared with the native gene, 
contains substitution mutations in the third and fourth 
quarters at positions 1323, 1523, and 1634. 

PCIB4414 is constructed from two plasmids, HG3.G4#18 
and 1HG which is described above. MG3.G4#18 is obtained by 
cloning* the Apa I/Kpn I fragment in plasmid Bt.P2#l into 
PQCN103 (using those same restriction sites) . This produces a 
plasmid containing the third and fourth quarters of the gene. 
The first half of the synthetic gene from plasmid 1HG is cut 
with Bam HI and Nco I and moved into MG3.G4#18 (containing the 
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Cbird ana fourth careers of the . The resultin, plas-id. 

pCIB4414 , contains a functional version of One synthetic g ene. 

being functional, the synthetic gene in this Plasa^d 
contains three errors; position 1326 <= substituted for a C, . 
position 1523 (substitute A for a G) , and at position 163* 

(substitution of a T for a C) . 

The fourth quarter in P CIB4414 is replaced with a 354 
bp fourth quarter Apa I\Bst E II fragment obtained from 
nybrxdizing, ligating, and restriction cleaving fourth quarter 
oligomers as described earlier, and isolating the fragment fro, 
a 2% Nusieve agarose gel. P CIB4408 is a synthetic Bt gene 
clone obtained by replacing the fourth quarter fragment xn 
PCIB4414 with the hybridized fourth quarter fragment. To 
insert the CaMV 35S promoter in front of the synthetic Bt gene, 

^ * _ a trh Eco NI\Kpn I fragment from plasmxd 
PCIB4406 is made from a 4 Kb Eco Nx\*.pn 

P 35SBt6 and 1.8 Kb Eco NI\K P n I fragment from P CIB4408. 

PCIB4406 is 100% active (as compared with the protexn 
from the native gene, against ECB but contains the substitution 
station in the third quarter of the synthetic gene at positxon 
X323 resulting in an amino acid substitution of a leucine for a 
phenylalanine. Plasmid P BS123#13 is used to repair thxs 
mutation • 

The third quarter fragment in plasmid P BS123#13 is made 
from an approximately 479 bp hybridized oligomer generated 
fragment. Third quarter oligomers 015-U2O and L15-L21 are 
fcinased, hybridized, and ligated as described above. PGR 
reactions are carried out as described above with primers P5 (a, 
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and P6(b> for 15 cycles. The PGR product is treated with 
proteinase K at a final concentration of about 50 pg\ml in an 
approximate 95 pi volume for 30 minutes at 37-C followed by 10 
minutes at 65-C (Crowe et al . , Nucleic Acid Research 19:184, 
1991.) Subsequently, the product is phenol\chloroform extracted 
and ethanol precipitated using standard procedures before 
cutting with restriction enzymes Apa I and Nco I. 

The approximate 450 bp Apa I\Nco I PCR fragment is 
ligated to the 3.8 Kb Apa I\Nco I vector fragment from P 1HG#6 
to make pBS123#13. Plasmid P BS123#13 contains the desired 
sequence for the third quarter of the maize optimized crylA(b) 
gene from position 1319 at the Nsp I site through the Apa I 
site at position 1493. This 170 bp Nsp I\Apa I fragment from 
pBS123#13 is used in the fully active synthetic cryIA<b) gene 
in plasmid pCIB4418. 

Western Blot Analysis: 

Western blot analyses of various transf ormants are 
performed using crude extracts obtained from coli grown on 
selective plates. Using a toothpick, cultures are scraped from 
fresh plates containing the transf ormants of interest which 
have been grown overnight at 37-C. The positive control for 
expression of the Bt gene in ^ coli was a construct called 
PCIB3069 which contains the native Bt-k gene fused with the 
plant expressible CaMV 35S promoter. P CIB3069 also contains 
the 35S promoter operably linked to the hygromycin resistance 
gene, 35S promoter, with Adh intron #1 operably linked to. the 
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GUS gene, an. 3SS promoter operably U« « • — — 
the production o£ the native Bt cryXACb, IP. A negate 
control of m eoli which does not contain a Bt gen. is aiso 
included in the analyses . Cultures are resuspended 
of loading buffer containing « mM Tris-HCl P H 6.8, 1* SDS, 
0 0025% bromophenol blue, 10* glycerol and 7.5% 
„erca P toethanol . After heating the mixtures at 95-c for 10 
minutes, the preparations are sonicated for 1-3 seconds. The 
debris is centrifuged in a microfuge at room temperature for 
about 5 minutes and 10 to 15 pi of each sample is loaded onto 
an acrylamide gel with a 10% running gel below a 6% steering 
gel <«emmli, 12^227; .80-685,1570,,. After electrophoresis 
overnight at 10 mAmps, proteins are transferred from the gel to 
an I mmobilon membrane .Mlllipore, . the transfer is done usxng 
an electrophoretic Blotting Unit (American BioHudear, 

,,«„. (2 n mM Tris, 150 mM glycine, 
Emeryville, CA> in transfer buffer (20 mM 

and 20% methanol) for 1.5 hours at 450 mAmps. 
Buffers for western blotting included: 
Blocking buffer: 2% Tween-20 

30 mM Tris-HCl pH 10.2 



150 mM NaCl 



Wash buffer: 0.05% Tween-20 

30 mM Tris-HCl pH 10.2 

150 mM NaCl 



Developing buffer: 100 mM Tris-HCl P H 9.6 
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After transfer is complete, the membrane is incubated 
for about ten minutes in the blocking buffer. Three 15 minute 
washes with wash buffer are done before the first antibody 
treatment. The first antibody is an immunoaf f inity purified 
rabbit or goat antibody prepared using the CrylA(b) protein as 
the antigen (Ciba-Geigy, RTP, N.C.; Rockland Inc., 
Gilbertsville, PA.; and Berkeley Antibody CO., Richmond, CA.). 
The crylA(b) specific antibody is treated immediately before 
use with E, coli lysate from Bio-Rad in a 1 ml volume with 5 yg 
of antibody, 50 pi of E. coli lysate in the wash buffer 
solution. This mixture is incubated for 1 hour at room 
temperature before diluting it 1 to 30 for a final dilution of 
1:6000 with wash buffer. Incubation of the membrane with the 
first antibody is at room temperature for 1.5 hours. 

Three 10 minute washes are done between the 1st and 2nd 
antibody treatments. The second antibody is either rabbit 
anti-goat or goat anti-rabbit/alkaline phosphatase conjugate 
(Sigma, St. Louis, MO.). Incubation with the alkaline 
phosphatase conjugate is carried out at room temperature for 
one hour using a 1 to 6000 dilution in wash buffer. Six 10 
minute washes are done between the second antibody 
treatment and developing the western blot . The western blot is 
developed in 100 ml of developing buffer with 440 ul of 
nitroblue tetrazolium in 70% dimethyl f ormamide (75 mg\ml) , and 
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, _ krte?nh ai-P in 100% dimethyl 
330 Ul of S-b.omo-4-chloro-indolyl-phosphate xn 

-FriT- 15 to 30 minutes, 
formamide (50 mg\ml> . after developing for 

the membrane is washed in water end air dried. 

Baiffilg 4 ; coy ^nnTTOM OF TFftNSFOFMATTON VECTORS 

construction of pCIB710 and derivatives. 

CaMV 35S Promoter cassette Plasmids PCIB709 and PCIB710 
«e oonstructed as shown in Rothstein et .1.. Gene 53 : 153-161 
(198 7, pCIBTIO contains CaMV promoter and transcriptron 
termination seuuences for the 35S » transcript tcovev et al., 
Hacj aMds.Res. . 9:6735-6747 ,1961).. A bp BglH 

restriction fragment of CaMV DNA (hp 6494-7643 in Hohn et al., 

-- r <- XL* tv - — " n "- 22 ° and 

Appendices A to <= (1982) ] is isolated from CaMV DMA by 
preparative agarose gel electrophoresis as described earlrer 
The fragment is mired with BamHI-deaved plasmid P UC19 DMA, 
treated with T4 DNA ligase. and transformed into ^ coir. 
.Note the BamH! restriction site in the resulting Plasmid is 
destroyed by ligation of the Bglll cohesive ends to the BamHl 

cohesive ends.) 

The resulting plasmid, called p 0 C19/35S. is then used 

in oligonucleotide-directed in-vitro mutagenesis to insert the 
BamHI recognition seguence GGATCC immediately following CaMV 
nucleotide 74S3 in the Hohn reference. The resulting Plasmid, 
PCIB710, contains the CaMV 35S promoter region and 
transcription termination region separated by a BamHI 
restriction site. DNA sequences inserted into this BamHI site 
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will be expressed in plants by these CaMV transcription 
regulation sequences. (Also note that P CIB710 does not contain 
any ATG translation initiation codons between the start of 
transcription and the BamHI site) . 

PCIB710 is modified to produce P CIB709 by inserting a 
Bam HI fragment containing the coding sequence for hygromycin 
phosphotransferase from P LG90 [Rothstein et al.. Gene, 
53:153-161 (1987)] in the Bam HI site. 

PCIB709 is modified to produce P CIB996 by removing the 
ATG just upstream from the initiation codon of the hygromycin 
phosphotranserase gene using standard mutagenesis techniques 
while inserting a Bgl II restriction site at this location. 
The resulting plasmid, P CIB996, is further modified to remove 
the Bam HI, Sma I and Bgl II sites in the 5' untranslated 
leader region located 5' of the initiation codon for the 
initiation codon. The result is a change of DMA base sequence 
from -TATAAGGATC CCGGGGGCA AGATCTGAGA TATG-Hyg to - T AT AAGGAT C 
TGAGATATG-Hyg . The resulting plasmid is known as P CIB3073. 

Alternatively, P CIB710 is modified to produce P CIB900, 
by inserting the Bam HI - Bel I fragment of P CIB10/35SBt, which 
contains the 645 amino acid Bt coding sequence, described in 
Part C4 below, into the Bam HI site of P CIB710 to create 
P CIB710/35SBt. To introduce an antibiotic resistance marker, 
PCIB709 is cut with Sal I, a Kpn I/Sal I adaptor is ligated and 
the resulting ligation product is cut with Kpn I. The Kpn 
fragment of P CIB709 containing the 35S/hygromycin resistance 
gene is inserted into the Kpn I site of pCIB710/35SBt to 
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produce PCIB900. 

cenes useful as toe select marxer gene include the 

hy,romyein MM » S*„e " ^ 

53 . 153-161 (1997) . The hygromycin gen. described in this 
r efer.nce is meed into a pUC plasmid such as PCIB710 cr 
P CIB709 and the -extra" ATG upstream fro. the hygromycin 
phosphotransferase ceding sequence is removed to create 
PCIB996. This modified PCIB996 gene is further modified to 
rem ove a BglH. BamHI and Sma! sites from the S- region of the 

m h .pchnioues of molecular biology to make 
gene using standard techniques 

pCIB3073. 

pCIB932 is a pOC19-bas.d plesmid containing the 
chimeric gene Pep-C:promot.r\Bt\Pep-c:terminacor. It is 

, opep-10. a Hindlll subclone 

composed of frag«*nts derived from pPEP 10, 

of a genomic clone. Hl-lambda-K, PKAS.0SA. »3=29 9 4- 2 8 88 

. of the maise gen. encoding the PEP carboxylase ensyme 
active in photosynthesis, and from P=IB930. which is a BamHI 
fragment containing the 645 amino acid truncated form of the 
the crylAn endotoxin gene in the BamH! site of pOC18 . 

2.6 xb BcoRI-XhoI fragment from pPEP-10. containing 
the polyA addition site from the PBP carboxylase gene, is 
isolated and digested with Pstx and Hindi. The restriction 
dig est is ligated with Pstl/HincI! digested PUCIS. transformed 

E^. coll. and transf ormsnts screened for those * 
— . *. verified r>y 



412 bp Pstl-Hindl insert in P UC18 and the i 
sequencing. The resulting plastnid is called P CIB931. 

The nuclear gene encoding the phosphoenolpyruvate 
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carboxylase isozyme <"Pep-C") is described in Hudspeth et al., 
Plant Molecular Biology , 12: 579-589 (1989). P CIB932 is 
constructed by the ligation of three fragments. The first 
fragment, containing the PEP-C transcription terminator, is 
produced by digesting pCIB931 to completion with Hindlll, 
partially with Sphl and the 3098 bp fragment isolated. The 
second fragment, containing the Bt endotoxin coding sequence, 
is produced by digesting pCIB930 with Ncol and Sphl and 
isolating the 1950 bp fragment. The third fragment, containing 
the PEP-C promoter, is produced by digesting pPEP-10 to 
completion with Hindlll, partially with Ncol and isolating the 
2.3 Jcb fragment. The ligation mix is transformed into E^. coli, 
transformants with the correct insertion identified and the 

insert verified by sequencing. 

pCIB932 is cut with PvuII to generate a 4.9 Kb fragment 
containing the maize Pep-C:promoter\Bt\Pep-C: terminator and 
purified on a 1% LGT agarose gel in IX TAE. The linearized 
PCIB3079 vector and the 4.9 Kb insert from pCIB932 are ligated 
using T4 DNA ligase in LGT to make pCIB4401. pCIB4401 is a 
maize transformation vector containing the chimeric genes: 
35S :promoter\PAT\35S : terminator , Pep-C : promoter \Bt\Pep-C : 
terminator, and 35 S :promoter \AdhI #1 intron\GUS\35S : 
terminator. 

Construction of pCIB246 (35S-GUS-35S) 

A CaMV 35S promoter cassette, P CIB246, is constructed 

as follows. 

The Ddel restriction site at nucleotide position 7482 
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TFranc* et al.. Cell, 21:285-294 (1980)] is 
of the CaMV genome [Franc* ec ax 

edified by insertion of a «. bp oligonucleotide 

■ - ncdi iccat :!' 

site, a Sail (GTCGAC) site, and an SstI (««C) ««" ' 
altered 35S promoter is inserted into a puCI, vector that 

„ad been modified to destroy the vector's SstI and sal! arte.. 
ms , the caMV 353 promoter or PCIB1500 contains unigue SstI 

and Sail sites for cloning. 

PCIB1S00 is digested with Satl/Hcol and ligated wrth 
the GuS gene obtained from P BI221 (Clontech Laboratories. Inc.. 

that the ATG of the Ncol site functions as the start codon for 
the translation of the GUS gene. The CaMV 3SS polyadenylatron 

» ^-h- 3» end of the chimeric 

and termination signals are used for the 3 

gene. 

Construction of P CIB3069 (35S-Adhl-GUS-35S) 

PCIB246 is modified by adding the maize alcohol 

ah>,i intron number 1 (Adhl) (Dennis et al., 
dehydrogenase gene Adhl intron numo 

„>, 17-3983-4000 (1984)) into the Sal I 
H»r1eic Ar ^" Research, 12.39BJ 

ait. of PCIB246 to produce plasmid PCIB3007. The Adhl intron 
is excised from the maize Adhl gene as a Bal I/Pst I fragment 
and suhcloned into P.C18 that was cut with Sma I/Pst I to ^ 
a plasmid called Adh 1026. Adh 102 6 is cut with Pvu 
the fragments are mad. blunt ended with T4 OKA polymerase, al 
X liters are added using standard procedures and a fragment 

« - 3 % NuSeive gel and ligated 

about 560 bp is recovered from a 3 * wuse 

_ m ^Tir-iP The Sal I linkered 
into Sal I cut/phosphatase treated P UC18 . The 
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Adh intron #1 in the resulting plasmid is cut out with Sal I, 
gel purified, and ligated into Sal I cut/ phosphatase treated 
pCIB246 to make plasmid pCIB3007. 

PCIB3007 is cut with PstI and the ends made blunt by 
using T4 DNA polymerase (NEW England Biolabs) according to the 
suppliers' specifications. The resulting blunt ended molecules 
are cut with Sph I and the approximately 5 . 8 Kb fragment with 
one blunt end and one Sph I end is purified on a low gelling 
temperature (LGT) agarose gel using standard procedures. 
PCIB900 is cut with Sma I/Sph I and the fragment containing the 
35S/Bt gene is purified on a LGT agarose gel. The two gel 
purified fragments are ligated in LGT agarose using T4 DNA 
ligase according to standard conditions. The resulting ligated 
fragments are transformed into Ej. coli using standard 
procedures and the resulting plasmid is called P CIB3062. There 
are two versions of P CIB3062. P CIB3062#1 has a Sma I site 
regenerated where the Sma I site and the T4 polymerase blunted 
ends are ligated. This most likely results from the T4 
polymerase nibbling a few base pairs from the Pst I site during 
the blunting reaction. P CIB3062#3 does not have this Smal 
site. 

pCIB3062#3 is cut with Kpnl and made blunt-ended using 
T4 DNA polymerase, and subsequently cut with Pvu II to yield a 
6 . 4 Kb fragment with blunt ends containing the 35S/GOS and 
35S/Bt genes. This blunt-end fragment is ligated into Sma I 
cut PCIB3073 to produce pCIB3063 or pCIB3069. P CIB3069 
contains the same fragment used to make P CIB3063, but the 
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^ic genes in PCXB306* - all in the - relative 
orientation, unlire those in PCXB3063. These plaids contain 
a , a 35S prefer operably linaed to toe hygromycin resistance 

W a 353 Procter. »ith Adh intron .1. operably linked to 
th e GUS gene.- and c, a 35S propter operably linked to a gene 
coding for the production of the synthetic cry!**) 
insecticidel protein fro. Bacillus t hnHngiensis ,, as described 

above. 

GUS Assays: 

GUS assays are done essentially as 
.efferson, «»- 5 = 387-405 (196'') As 

«r T R2 4 6 contains a CaMV 35S promoter fused 
shown above, plasmid pCIB246 contains 

with the GUS gene. The 5< untranslated leader of this chre.ric 
g ene contains a copy of the mai,e Adhl intron .1. Xt is used 
here as a transforation control . Although the same amount of 
PCIB246 is added to each transformation, the calculated 
activity varied among Bt constructs tested. The values 
.sported below are averages of 3 replicates. PCXB4407 -as 



tested, twice 



PCIB3069 28 nM MU/ug/min 

pCIB4407 



0.7 nM MU/ug/min, 2.3 nM MU/ug/min 



EXAMPLE -^-S^M" IHSECTICIDS1 

The synthetic cryXA(b) gene in P CIB4414 in salt-" 
assayed for insecticidel activity against European corn borer 
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according to the following protocol. 

Molten artifical insect diet is poured into a 60 nun 
Gellman snap-cap petri dish. After solidification, E. coli 
cells, suspended in 0.1% Triton X-100, are spread over the 
surface at a concentration of 3 X 107 cells/cm2. The plates 
are air dried. Ten first instar European corn borer, pstrinia 
nubilalis , which are less than 12 hours old are then placed 
onto the diet surface. The test is incubated at 30 C in 
complete darkness for 2-5 days. At the end of the test percent 
mortality is recorded. A positive clone has been defined as 
one giving 50% or higher mortality when control E, coli cells 
give 0-10% background mortality. 

For comparison, the native crylA(b) gene in P CIB3069 is 
tested at the same concentration. Clones are tested at 3 x 10 
cells/cm 2 diet; 20 insects per clone. 

The following results are observed: 

clone Percent Mortality 

Control , ° 

pCIB3069 "J 
pCIB4414 100 

These results indicate that the insecticidal crystal 
protein produced by the synthetic crylA(b) gene demonstrates 
activity against European corn borer comparable to that of the 
IP produced by the native crylA(b) . Other plasmids containing 
a synthetic crylA(b) gene were assayed in a similar manner. 

EXAMPLE 5AB : ASSAY OF CRYIA(b) PROTEIN FOR INSECTICIDAL 
ACTIVITY AGAINST SUGARCANE BORER. 

CrylA(b) was expressed in ^ coli and assayed for 
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insecticidal activity against Sugarcane borer (Diatrea 

-4- rr + n the same protocol used for European 
ca ^haralis ) according to the same pro 

corn borer, described immediately above. The results are 
summarized in the Table. 

TABLE 

- ee . v wtth Bt PROTEIN FROM E^ COL I 
SUGARCANE BORER ASSAY WITH bc rnu " 

_ „ Percent 



10 
25 
50 
100 
250 
500 
1000 

LC50 



0 
0 
7 
13 
40 
53 
80 

380 



95% CI 2 49 " 646 



The results indicate that the insecticidal protein 
produced by a maize optimized Bt gene is e«| c ~ A ( *, pro tein, 
lugarcane borer . ™ jWgJSXS 5 ^ransgeni/maize plants 
^ffJcSJS-S'^ instant -invention. 

EXAMPLE 6: MAIZE PROTOPLAST ISOLATION AND TRANSFORMATION WITH 
EXAM * THR SYNTHF-TTP- BT GENE 

Expression of the synthetic Bt gene is assayed in 
transiently transformed maize protoplasts . 

Protoplast isolation Procedure: 

1. The contents of 10 two day old maize 2717 Line 6 
suspension cultures are pipetted into 50 ml sterile tubes and 
allowed to settle. All culture media is then removed and 



discarded. 
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2. Cells (3-5 ml Packed Cell Volume) are resuspended 
in 30 ml protoplast enzyme solution. Recipe follows: 
3% Cellulase RS 

1% Macerozyme RIO in KMC Buffer 
KMC Buffer (recipe for 1 liter) 
KC1 8.65 g 

MgCl 2 -6H 2 0 16.47 g 

CaCl 2 -2H 2 0 12.50 g 

MES 5.0 g 

pH 5.6, filter sterilize 

3. Mix cells well and aliquot into 100x25 mm petri 
dishes, about 15 ml per plate. Shake on a gyratory shaker for 4 

hours to digest. 

4. Pipette 10 ml KMC through each 100 micron sieve to 
be used. Filter contents of dishes through sieve. Wash sieve 

with an equal volume KMC. 

5. Pipette sieved protoplasts carefully into 50 ml 
tubes and spin in a Beckman TJ-6 centrifuge for 10 minutes at 

* 

1000 rpm (500 x g) . 

6. Remove supernatant and resuspend pellet carefully in 
10 ml KMC. Combine contents of 3 tubes into one and bring 

volume to 50 ml with KMC. 

7. Spin and wash again by repeating the above step. 

8. Resuspend all washed protoplasts in 50 ml KMC. 
Count in a hemocytometer . Spin protoplasts and resuspend at 8 
x 10 6 /ml in resuspending buffer (RS Buffer) . 

RS Buffer (recipe for 500 ml) 
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mannitol 27.33 g 

CaCl 2 (0.1 M stock) 75 ml 

MES 0.5 g 

pH 5.8, filter sterilize 
Protoplast Transformation Procedure: 

1. Aliquot 50 ug Plasmid DNA (Bt IP constructs, both 
synthetic < P CIB4407> and native ( P CIB3069) ) to 15 ml 
poly st y rene culture tubes. Also aliquot 25 ug GUS-contain.ng 
plasmid DHA (which does not contain Bt IP <PCIB246> to all 
tubes. 3 replications are used per construct to be tested, 

with 1 rep containing no DNA as a control. 

rc us construct; 
Bt constructs: — 

PCIB246 

pCIB3069 * 
pCIB4407 

2. Gently 1. protoplasts -ell and aliquot 0.5 ml per 

tube. 

3. Add 0.5 ml PEG-40 to each tube. 

PEG-40: 

0.4 M mannitol 

0.1 M Ca(N0 3 .) 2 -4H 2 0 

pH 8.0, filter sterilize 

4. Mix gently to combine protoplasts with PEG. Wait 30 

, „i -? ml and 5 ml W5 solution at 

5. sequentially add 1 ml, 2 ml, ana => 



minutes 



5 minute intervals 
W5 Solution: 
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154 mM NaCl 

125 mM CaCl 2 -H 2 0 

5 mM KC1 

5 mM glucose 
pH 7.0, filter sterilize 

6. Spin for 10 minutes in a Beckman TJ-6 centrifuge at 
about 1000 rpm (500g) . Remove supernatant. 

7. Gently ^resuspend pellet in 1.5 ml FW media and plate 

«. 

carefully in 35x10 mm petri dishes. 
FW media (recipe for 1 liter) : 

MS salts 4.3 g 

200X B5 vits. 5 ml 

sucrose 30 g 

proline 1.5 g 

mannitol 54 g 

2,4 D 3 mg 

pH 5.7, filter sterilize 

8. incubate overnight in the dark at room temperature. 

9. Perform GOS assays, insect bioassays, and ELISA's on 
protoplast extracts as described below. 

EXAMPLE 7: CONSTRUCTION OF A FULL-LENGTH SYNTHETIC MAIZE 

. OPTIMIZED raYIA(b) GENE 

Sequence 4 shows the synthetic maize optimized sequence 
encoding the full-length crylA(b) insecticidal protein from B, 
1-hnrinaiensis . The truncated version described above 
represents the first approximately 2 Kb of this gene. The 
remainder of the full-length gene is cloned using the 
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proc e d ures a— lb- ^cve. Briefly, this P«— «• 
synthesizing M .U,« S of ,0 ,c SO „ in length, typically 

• The oligomers are purified 

using 80 mers as an average size. The olig 

using standard procedures of HPLC or recovery from a 
polyaorylamide gel . Purified oligomers are Unas- and 
hybridised to for. fragments of about 500 bp. The hybridised 
oligomers can be amplified usin, PCR under standard conditions. 
The 500 bp f regents, either directly from hybridizations, from 
PCR amplification, or recovered from agarose gels after either 
hybridization or PCR amplification, are then cloned into a 

, u roll using standard procedures, 

plasmid and transformed into L coll using 

# . . nrT desired inserts are 

Recombinant plasmids containing the desire 

I as described above, using PCR and/or standard 
_ procedures . Inserts that appear correct based upon 
Chair PCR and/or restriction enzyme profile are then sconced 
to identify those clones containing the desired open reading 
£ta .e. Tbe f ragments are then ligated together with the 
approximately 2 Kb synthetic serene, described in Example to 
produce a full-length maize optimized synthetic cryiMb, 
ge ne useful for expression of high levels of crylMb, Protein 
in maize. 

G+ C Content of native and synthetic Bt genes : 

38 8 % 

Full-length native 

37 2% 

Truncated native 

64 8% 

Full-length synthetic 

64.6% 

Truncated synthetic 
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% homology of the final truncated version of the Bt gene 
relative to a "pure" maize codon usage gene: 98.25% 
EXAMPLE 8 Construction of a plant expressible, full-length, 
hybrid partially maize optimized crylA(b) gene. 

pCIB4434 contains a full length CrylA(b) gene 
comprised of about 2 Kb of the synthetic maize optimized 
crylA(b) gene with the remainder (COOH terminal encoding 
portion) of the gene derived from the native gene. Thus, the 
coding region is a chimera between the synthetic gene and the 
native gene, but the resulting protein is identical to the 
native crylA(b) protein. The synthetic region is from 
nucleotide 1-1938 (amino acids 1 to 646) and the native coding 
sequence is from nucleotide 1939-3468 (amino acids 647 to 
1155) . The sequence of this gene is set forth in Fig. 7. A 
map of pCIB4434 is shown in Fig. 8. 

The following oligos were designed to make pClB4434 : 

KE134A28 = 5 ' -CGTGACCGAC TACCACATCG ATCAAGTATC CAATTTAGTT 
GAGT-3' 

KE135A28 - 5' -ACTCAACTAA ATTGGATACT TGATCGATGT GGTAGTCGGTC 
ACG- 3 ' 

KE136A28 = 5 ' -GCAGATCTGA GCTCTTAGGT ACCCAATAGC GTAACGT-3' 
KE137A28 = 5' -GCTGATTATG CATCAGCCTAT - 3 ' 
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rr-rrTTATTC CTCCATAAGA AGTAATTC-3' 
KE138A28 = 5 ' -GCAGATCTGA GCTCTTATTu 

MK05A28 = 5 ' — CAAAGGTACC CAATAGCGTA ACG-3' 
MK35A28 = 5' -AACGAGGTGT ACATCGACCG-3' 

PCIB4434 is made using a four-way ligation with a 5.7 
kb fragment from P CIB4418, a 346 bp Est B II\K P n I 
PCR-generated synthetic -.native fusion fragment, a 108 b P K P n 
I\Nsi I native CrylA(b) fragment from pC»1315, and a 224 bp 
Nsi I\Bgl II PCR-generated fragment . Standard conditions for 
ligation and transformation are as described previously. 

A synthetic mative gene fusion fragment is made xn two 

• n prp The first 253 bp of the PCR fusion fragment 
steps using PCR. Tne tirst. r 

is made using 100 pmols of oligos KE134A28 and MK04A28 with 
approximately 200 ng =f native cryIMb) "opiate in a 100 ul 
volume with 200 n. of eaoh CHIP, 1 X PCR buffer (Perxin Elmer 
cetus, , 20 % glycerol, and 5 units of Tag polymerase (Perxin 
Elmer Cetus, . The PCR reaction is run with the following 
p«ameters: 1 minute at 94-0. 1 minute at 55-C, 45 seconds at 
72 .c. with extension 3 for 3 seconds for 25 cycles. A fraction 
(1 %) of this first PCR reaction is used as a template along 
with 200 ng of the synthetic crylA(h) OH* to make the complete 
351 bp synthetic: native fusion f ragment. Oligos used as PCR 
primers in this second PCR reaction are 50 pmols of XK35A28, 50 
pmols of «K0«28. and 25 pmols of KB135A28. The PCR reaction 
mix and parameters are the same as those listed above. The 
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resultant 351 bp synthetic : native fusion fragment is treated 
with Proteinase K at 50 ug\ml total concentration and 
P henol\chloroform extraction followed by ethanol precipitation 
before cutting with Bst E II\Kpn I using standard conditions. 

The 224 bp Nsi I\Bgl II PCR fragment used in making 
P CIB4434 is made using 100 pmols of oligos KE137A28 and 
KE138A28 and 200 ng of the native crylA(b) gene as template in 
100 ul volume with the same PCR reaction mix and parameters as 
listed above. The 230 bp PCR native crylA(b) fragment is 
treated with Proteinase K, phenol\chlorof orm extracted, and 
ethanol precipitated as described above, before cutting with 

Nsi I\Bgl II. 

PCIB4434 was transformed into maize protoplasts as 

described above. Line 6 2717 protoplasts were used with 
PCIB4434 and pCIB4419 as a control for comparison. The results 
are shown below: 



ng Bt/mg protein 



4419<35S) 14,400 * 2,100 



4434 (full-length) 2,200 ± 900 



Background = 13 ng Bt/mg protein for untransf ormed protoplasts 



The results indicate that pCIB4434 expresses at a 
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of about 15% of PCIB4419- 

Western blot analysis shows at least one-third of 
cryIA <h> Protein produce* hy PCIB4434 in this syste. is ahout 
130 *0 in size. Therefore, a significant amount of full-length 
cryIA(b , Protein is produced in .aize cells fro. the expressxon 
of pCIB4434. 

EXAMPLE 7. construction of a full-length. crylA.b, genes 

encoding a temperature-stable crylA.b) protein. 

Constructs pCIBSSll-5515, each containing a 
full-length. crytA.b, gene are described below. In these 
seguences. the 26 amino acid deletion between amino adds 7.3 

and 794, KCGEPHRCAPHLEHHPDLDCSCFDSE, present in crylAla) an 
crylA.o. but not in cryiAO,,. ha, been repaired. The gene rn 
pC IB5 513 is synthetic- the other four genes are hybrids, and 

thus are partially maize optimized. 

Construction of pCIBSSll 
This plasmid is a derivative of P CIB4434. A map of 
pCI B5511 is shown in Elg. 10. A 435 bp segment of OKA between 
bp 2165 and 2530 was constructed by hybridization of synthetic 
oli gomers designed to represent the upper and lower strand as 
described above for the construction of the truncated crylA <b, 
,ene. This segment of synthetic DNA is synthesized using 
standard technics Xnown in the art and includes the 26 ammo 
aci d deletion found to occur naturally in the crylA ( b, protein 
ta Bacillus ^inelensis Kurstari HD-1 • The entire inserted 
segment of DHA uses maize optimized codon preferences to encode 
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occurring deletion are contained within this fragment. They are 
inserted starting at position 2387 between the Kpnl site at nt 
2170 and the Xbal site at nt 2508 (2586 in pCIB5511) of 
pCIB4434. pCIB5511 is constructed via a three way ligation 
using a 3.2 Kb fragment obtained by restriction digestion of 
PCIB4434 with SphI and Kpnl, a 3.8 Kb fragment obtained by 
digestion of P CIB4434 with SphI and Xbal, and a 416 bp fragment 
obtained by digestion of the synthetic DNA described above, 
with Kpnl and Xbal. Enzymatic reactions are carried out under 
standard conditions. After ligation, the DNA mixture is 
transformed into competent coli cells using standard 
procedures. Transf ormants are selected on L-agar containing 100 
ug/ml ampicillin. Plasmids in transf ormants are characterized 
using standard mini-screen procedures. The sequence of the 

crylA(b) gene encoding the cryIA<b) temperature (heat) 

stable protein is set forth in Fig. 9. 

Construction of pCIB5512 
This plasmid construct is a derivative of pCIB4434 . 
A map of pCIB5512 is shown in Fig. 12. DNA to repair the 26 
amino acid deletion is prepared using standard techniques of 
DNA synthesis and enzymatic reaction. Three double stranded DNA 
cassettes, pGFcasl, pGFcas2 and P GFcas3, each about 300 bp in 
size, are prepared. These cassettes are designed to contain the 
maize optimized codons while maintaining 100% amino acid 
identity with the insecticidal protein. These cassettes are 
used to replace the region between restriction site BstEII at 
position 1824 and Xbal at position 2508 and include the 
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insertion or the additional 78 bp which encode the missing 26 
amino acids (described above for pCIBSSll in P CIB4434) . Each of 
^ese cassettes is cloned into the EcoRV site of the vector 
Bluescript (Stratagene) by standard techniques. The three 
cassettes are designed to contain overlapping restriction 
sites. Cassette 1 has restriction sites BstEII at the S< end 
and EcoRV at the 3' end: cassette 2 has EcoRV at the 5' end and 
Clal at the 3' end and cassette 3 has Clal at the 5' end and 
Xba I at the 3' end. They are cloned individually in Bluescript 
and the the complete 7 62 bp fragment is subsequently assembled 
by ligation using standard techniques. P CIB5512 is assembled 
using this 762 bp fragment and ligating it with a 6.65 Kb 
fragment obtained by a complete digestion of P CIB4434 with 
BstEII and a partial digestion with Xbal . Alternatively, a four 
way ligation using the same vector and the three cassettes 
digested with the specific enzymes can be employed. Enzymatic 
reactions are carried out under standard conditions . After 
ligation, the DNA mixture is transformed into competent E, colx 
cells using standard procedures. Transf ormants are selected on 
L-agar containing 100 ug/ml ampicillin. Plasmids in 
transformants are characterized using standard mini-screen 
procedures. The resulting plasmid is P CIB5512. The sequence 
of the repaired crylA(b) gene is illustrated in Fig. 11. This 
repaired crylA(b) differs from that carried in P CIB5511 in that 
a larger region of the crylA(b) coding region is optimized for 
maize expression by using maize preferred codons. 

Construction of pCIB5513 
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This plasmid contains a repaired crylA(b) gene derived 
from pCIB5512. A map of P CIB5513 is shown in Fig. 14. The 
region 3' from the Xbal site at position 2586 to the end of the 
gene (Bglll site at position 3572) is replaced entirely with 
maize optimized codons. This region is synthesized, using 
standard techniques of DNA synthesis and enzymatic reaction, 
well known in the art, as four double stranded DNA cassettes 
(cassettes # 4,5 ,6 ,7). Adjacent cassettes have overlapping 
restriction sites to facilitate assembly between cassettes. 
These are Xbal and Xhol at the 5' and 3' ends of cassette 4; 
Xhol and Sad at the 5' and 3' ends, respectively, of cassette 
5; Sad and BstXI at the 5' and 3' ends, respectively, of 
cassette 6; and BstXI and Bglll at the 5' and 3' ends, 
respectively, of cassette 7. As described for pCIB5512, the 
cassettes are cloned into the blunt-end EcoRV site of the 
Bluescript vector (Stratagene) and the full-length "repaired" 
crylA(b) gene cloned either by sequential assembly of the above 
cassettes in Bluescript followed by ligation of the complete 
967 bp synthetic region with a 6448 bp fragment obtained by a 
complete digestion of pCIB5512 with Bglll and a partial 
digestion with Xbal. Alternately, the plasmid containing the 
full-length genes is obtained by a 5-way ligation of each of 
the four cassettes (after cleavage with the appropriate 
enzymes) and the same vector as above. The sequence of the 
full-length, "repaired" cry IA(b) gene is set forth in Fig. 13. 
The protein encoded by the various synthetic and 
synthetic/native coding region chimeras encode the same 
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protein. This protein is the b-t^utt. version of cryXA.b, 
produced by repairing the naturally occurring 26 amino acid 
deletion found in the cryXACb, gene from Bacillus thuringiensis 
kurstaki HD-1 when the homologous region is compared with 
either =r,IA(a) or crylA(c) Bacillus rhurlngiensis . 
delta-endotoxins . 

Construction of pCIB5514 
This plasmid is a derivative of P CIB4434. A map of 
PCIB5514 is shown in Fig. 16. It is made using synthetic DNA 
cassette #3 (see above) which contains a maize optimized 
sequence of the region between the Clal site (position 2396) 
found in the 26 amino acid thermostable region and the Xbal 
site at position 2508 in P CIB4434 (2586 in pCXBSSll) . The 
region between nt 2113 of P CIB4434 and the junction of the 
thermostable region is PGR amplified by using P CIB4434 as 
template with the following primers : 

forward: 5 ' GCACCGATATCACCATCCAAGGAGGCGATGACGTATTCAAAG-3 ' 
reverse: 

5' -AGCGCATCGATTCGGCTCCCCGCACTTGCCGATTGGACTTGGGGCTGAAAG - 3 ' . 

The PGR product is then digested with restriction 
enzymes Kpnl and Clal and ligated in a four part reaction with 
a 189 bp fragment obtained by digestion of cassette 3 with Clal 
and Xbal, a 3.2 Kb fragment of P CIB4434 digested with Sphl and 
Kpnl, and a 3.8 Kb fragment of P CIB4434 obtained by digestion 
with Sphl and Xba. Enzymatic reactions are carried out under 
standard conditions. The ligation product is transformed into 
competent E coll cells, selected with ampicillin and screened 
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using standard procedures described above. The sequence of the 
repaired crylA(b) gene contained in P CIB5514 is shown in Fig. 
15. 

Construction of pCIB5515 
pCIB4434 was modified by adding the 78bp Geiser 
thermostable element (Geiser TSE) , described above, between 
the Kpn I site {2170 bp) and the Xba I site (2508 bp) in the 
native Btk region. The exact insertion site starts at the 
nucleotide #2379. The region containing the Geiser TSE was 
amplified by two sets of PCR reactions, i.e. the Kpn I - Geiser 
TSE fragment and the Geiser TSE - Xba I fragment. 



PCR primer#l: (Kpn I site) 

5' - ATTACGTTAC GCTATTGGGT ACCTTTGATG - 3' 



PCR primer#2: (Geiser TSE bottom) 

5' - TCCCCGTCCC TGCAGCTGCA GTCTAGGTCC GGGTTCCACT 
CCAGGTGCGG AGCGCATCGA TTCGGCTCCC CGCACTTGCC 
GATTGGACTT GGGGCTGA - 3' 



PCR primer#3: (Geiser TSE top) 

5' - CAAGTGCGGG GAGCCGAATC GATGCGCTCC GCACCTGGAG 
TGGAACCCGG ACCTAGACTG CAGCTGCAGG GACGGGGAAA 
AATGTGCCCA TCATTCCC - 3' 



PCR primer#4: (Xba 1 site) 

5' - TGGTTTCTCT TCGAGAAATT CTAGATTTCC - 3' 
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After the amplification, the PC* fragments »ere 
digested with (Kpn I + CI. II »«» (Cl« + Xba I> , 
t espectively. These two fragments were iigated to the Kpn ! 
and Xba I digested PCXB4434. The resulting construct P CIB551S 
is P CIB4434 with a Geiser TSB and an extra Cla 1 site flanged 
by Kpn I and xba I. A nap of pCiBoSIS is illustrated in Fig. 
38. The crytACb, gene contained herein, which encodes a 
temperature stable crylA.b) protein, is shown in Fig. 37. 

Examples 9-20 set forth below are directed to the 
isolation and characterization of a pith-preferred promoter. 
EXAMPtE 9 . BNA Isolation and Northern Blots 

MX RHA was isolated from plants grown under greenhouse 

, , Hoseribed in Kramer et 
conditions. Total RNA was isolated as descrioe 

ok -„i QQ.1214-1220 (1990) from the following 
al., p1ant Phvsxol., 90-lzii 

,i i* 9«5 35. 40, and 60 
tissues of Fun* maize line SH984: 8, 11, 15, 25, 35, 

day old green leaves; 8. 11, 15. 25, 35, 39, 46, 60 and 70 day 
old Pith, 60 and 70 day old brace roots from Fun* maize line 
5H9 84.. 60 and 70 day 5H984 sheath and ear stoox. RHA -as also 
isolated from 14 day 2110 roots and from developing seed at 
weexly intervals for weexs one through five post-pollenation. 
Poly A + RHA was isolated using oligo-dT as described by 

ri».l«.. A Xal '•■'"•y Manual (2nd 

Sambrook et ml., Holecular cloning. A nan 

ed 1 . 1989, and northern blots were carried out, also as per 
Sambroo* et al. using either total RHA <30 ug» or poly A + RHA 
(2 . 10 „,) . After electrophoresis, BHA was blotted onto 
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Nitroplus 2000 membranes (Micron Separations Inc) . The RNA was 
linked to the filter using the stratalinker (Stratagene) at 0.2 
mJoules. The northerns were probed with the 1200 bp EcoRI pith 

(TRpA) 8-2 cDNA fragment, isolated by using 0.8% low melting 
temperature agarose in a TBE buffer system. Northerns were 

hybridized and washed and the filters exposed to film as 

described in Isolation of cDNA clones. 

EXAMPLE 10. Isolation of cDNA Clones 

First strand cDNA synthesis was carried out using the 
BRL AMV reverse transcriptase system I using conditions 
specified by the supplier (Life Technologies, Inc., 
Gaithersburg, MD) . Specifically, 25 ul reactions containing 50 

Tris-HCl P H 8.3, 20 mM KC1, 1 mM DTT, 6 mM MgC12, 1 mM each 
of each dNTP, 0.1 mM oligo (dT) 12-18, 2 ug pith poly (A + ) RNA, 
100 ug/ml BSA, 50 ug/ml actinomycin D, 8 units placental RNase 
inhibitor, 1 ul (10 mM Ci/ml) 32P dCTP >3000 mCi/mM as tracer, 
and 30 units AMV reverse transcriptase were incubated at 42 «C 
for 30 min. Additional KC1 was added to a concentration of 50 
mM and incubation continued a further 30 min. at 42-C. KC1 was 
added again to yield a final concentration of 100 mM. 
Additional AMV reverse transcriptase reaction buffer was added 
to maintain starting concentrations of the other components 
plus an additional 10 units, and the incubation continued at 
42-C for another 30 min. Second strand synthesis was completed 
using the Riboclone cDNA synthesis system with Eco RI linkers 
(Promega, Madison, WI) . Double stranded cDNA was sized on an 
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« agarose gel using ^orate-KDT* huffer as disclosed » 

s^oo* e t ax., and showed an average ot " 

, „.:.-, MA45 DEAE membrane so as 
The cDNA was size fractionated using NA45 u 

. ^ mftlpcules of about 1000 bp or larger using 
to retain those molecules or wou 

conditions specified b y the suppiier (Schleicher and Schuell ^ 
Size fractionated cDNA was ligated into the Lambda ZapII vector 

t a Jolla CA) and packaged into lambda particles 
(Stratagene, La Jolla, caj ^ * 

using Gigapac* I! Plus .stratagene, L a J0 XX». CM . The 
unified Xibrary nad a titer of 3XS.OO0 pfu whiXe the 
a^Xified Xihrar, nad a titer of 3.5 niliion/ml usxng 

CS11S ' Hainan* Phage were plated at a density of 5000 pfu 

» 4-«+-*i of 50,000 phage were 
on ISO X 15mm L-agar plates. A total of 5U,u 

scr eened using duplicate lifts from each plate and probes of 
first strand cDNA generated from either pith derived mRNA or 

n -F+- o were done as described in 
seed derived mRNA. The lifts were done 

f uters DNA was fixed to 
Sambrook et al. using nitrocellulose filters. 

the filters by UV crosslinking using a Stratalinker 

t* Jolla CA) at 0.2 mJoule. Prehybridization and 
(Stratagene, La Jolla, , . nn of 

-a^-iort out in a solution o£ 
hybridization of the filter were carried out 

10 X Denhardts solution, 150 sheared salmon sperm DNA, 

SDS 50 mM sodium phosphate pH 7, 5 mM EDTA, 6X SSC, 0.05% 

>. Prehybridization was at 62-C for 4 hours 

sodium pyrophosphate . Prehynriai 

hyhridiretion was at 62-C tor 1. hours (overnight, wrth 
n iXXion CP»,»X in a voXume of 40 .X. riXters were -ashed xn 
500 na of 2X SSC. 0.5* SDS at room temperature for X5 ain. 



100 



WO 93/07278 



PCT/US92/08476 



J 



Radiolabeled DNA probes were made using a BRL random prime 
labeling system and unincorporated counts removed using Nick 
Columns (Pharmacia) . Filters were exposed overnight to Kodak 
X-Oraat AR X-ray film with (DuPont) Cronex Lightning Plus 
intensifying screens at -80-C. Plaques showing hybridization 
with the pith-derived probe and not the seed-derived probe were 
plaque purified for further characterization. 

EXAMPLE 11. Isolation of Genomic Clones 

Genomic DNA from Funk inbred maize line 211D was 
isolated as described by Shure et al., Cell, 35:225-233 (1988). 
The DNA was partially digested with Sau 3A and subsequently 
size fractionated on 10-40% sucrose gradients centrifuged in a 
Beckman SW40 rotor at 22,000 rpm for 20 hours at 20»C. 
Fractions in the range of 9-23 Kb were pooled and ethanol 
precipitated. Lambda Dash II (Stratagene) cut with Bam HI was 
used as described by the supplier. The library was screened 
unamplified and a total of 300,000 pfu were screened using the 
conditions described above. The library was probed using pith- 
specific (TrpA) cDNA clone 8-2, P CIB5600 which was identified 
in the differential screen of the cDNA library. Isolated 
clones were plaque purified and a large scale phage preparation 
was made using Lambdasorb (Promega) as described by the 
supplier. Isolated genomic clones were digested with Eco RI 
and the 4 . 8 kb EcoRI fragment was subcloned into Bluescript 



'* vector (Stratagene) . 
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DMA sequence and Computer Analysis 
"ieotide sequencing »as performed usrng the drdeoxy 
cB ain-ter»inacion method disclosed in Sanger et al.. 
fN AS. 74 =3463-3467 ,1.77,. Sequencing primers were synthes.sed 
He Applied Biosystems model 380B D»A synthesiser usrn, 
standard conditions. Sequencing reactions were carried out 
using the Sequenase system <0S Biochemical Corp.,. Gel 
analysis was performed on ,0 cm gels of 6* polyacrylam.de 
, M urea in Tris-Borate-KOTA buffer <BBX Gel-Hix 6, . Analyses 
of seances and comparison with seances in GenBan* were done 
using the O. of Wisconsin Genetic Computer Group Sequence 
Analysis Software (UWGCG) • 

Mapping the Transcriptional start Site 
P^er extension .as carried according to the procedure 
of Metraux et al., PMAS.. 86 = 886-800 UM» . Briefly. 30 ug of 
«^ pith total BNA -ere annealed with the primer in SO - 

-, « an n KC1 3 mM MgC12 <RT buffer) by heating to 
Tris pH 7.5, 40 ra« j ran y 

' ,. ... i ? . c Tbe FMA/pnmer 

80-c for 10 minutes and slow cooling to 42 C. 

*ix was allowed to hybridise overnight. Additional BT buffer 
« co 8 mM, BSA to 0.1 m,/ml. B*Asin at 4 U/»l and dHTP-s at 1 
m each were added. Then 8 units AMV reverse transcriptase were 
aCded and reaction placed at 37-C for one hour. The primer used 
was 5- -CCGITCGITC CTCCTTCGTC GAGG-3' , which starts at +90 bp 

relative to the transcription start. See Fig. 29A. A 
extension reaction was generated using the 4.8 Kb genomic clone 
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to allow determination of the transcriptional start site. The 
sequencing reaction was carried out as described in Example 12 . 

RNase protection was used to determine if the the 371 
bp sequence from +2 bp to +373 bp (start of cDNA) was 
contiguous or if it contained one or more introns. A 385 bp 
Sphl-Ncol fragment spanning +2 bp to +387 bp relative to 
transcriptional start see Fig. 29B was cloned into P GEM-5Zf (+) 
(Promega) and transcribed using the Riboprobe Gemini system 
(Promega) from the SP6 promoter to generate radioactive 
antisense RNA probes as described by the supplier. RNase 
protection was carried out as described in Sambrook et al. 
PBR322 (cut with Hpall and end labelled with 32P-dCTP) and 
Klenow fragment were used molecular weight markers. Gels were 
6% acrylamide/7M urea (BRL Gel-Mix 6) and were run at 60 watts 
constant power. 

EXAMPLE 14. Genomic Southern Blots 

Genomic DNA was isolated from maize line 211D using the 
procedure of Shure et al. ( supra . 8 ug of genomic DNA were 
used for each restriction enzyme digest. The following enzymes 
were used in the buffer suggested by the supplier: BamHI, 
EcoRI, EcoRV, HindHI, and Sad. Pith cDNA clone number 8-2 
was used for estimating gene copy number. The digested DNA was 
run on a 0.7% agarose gel using Tris-Borate-EDTA buffer system. 
The gel was pretreated with 250 mM HC1 for 15 min. to 
facilitate transfer of high molecular weight DNA. The DNA was 
transferred to Nitroplus 2000 membrane and subsequently probed 
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• . nB1 a-2 The blot was washed as described 
with the pith cDNA 8-2. Tne 

in Example 10 . 

EXAMPLE 15. PCR Material and Methods 

reactions were preformed using the GeneAmp DNA 
plication reagent 3cit and AmpliTag recombinant Tag DNA 
polmerase (Perfcin Elmer Cetus) . Reaction condition were as 
follows: 0.1 to 0.5 uM of each of the two primers used per 
reaction, 25 ng of the pith 4.8 Kb EcoRl fragment in 

„ described by the supplier 
Bluescript, plus the PCR reaction mix describe 

f or a total volume of 50 uL in 0.5 mL GeneAmp reaction tube 
( Per*in Elmer Cetus, . The DNA Thermal Cycler (Perfcin Elmer 
Cetus, using the Step-Cycle program set to denature 
60 s, anneal at 55-C for 60 s, and extend at 72-C for 45 s 
Allowed by a 3-s-per-cycle extension for a total of 30 cycles 
Tfte following primer sets were used: X. 83 X 84, -42, bp 

. ™ a« v 75 +239 bp to +372 bp. These are marked 
+258 bp; and IV. 40 X 75, r»p 

on Fig. 24. 

^j^^ isolation of a Pith-Preferred Gene. 

A cDNA library derived from pith mRNA cloned into 

^ first strand cDNA derived from 

Lambda Zap and screened using first stran 

either pith or seed mRNA. Clones which hybridized with only 
t he Pith probe were plague purified and again screened. Clones 
passing the second screen were used as probes in northern blots 
containing RNA from various maize tissues. 
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EXAMPLE 17, Gene Structure and Sequence Analysis. 

The 1.2 Kb insert of the cDNA clone 8-2 was sequenced 
using the dideoxy method of Sanger et al., supra. Likewise, 
the genomic equivalent contained on a 4 . 8 Kb EcoRI fragment in 
Bluescript denoted as pCIB5601, was sequenced. This 
information revealed that the genomic copy of the coding region 
spans 1.7 Kb and contains five introns . The mRNA transcript 
represents six exons. This is shown in Fig. 24. The exons 
range in size from 43 bp to 313 bp and the introns vary in size 
from 76 bp to 130 bp. The entire sequence of the gene and its 
corresponding deduced amino acid sequence are shown in Fig. 24. 

This gene encodes a protein of 34 6 amino acids with a 
molecular mass of about 38 kD. As illustrated in Table 1, the 

protein shows 62% similarity and 41% identity with 
the subunit protein of Pseudomonas aeruginosa and has high 
homology with trpA proteins from other organisms. 

Table 1 

Conservation of TrpA sequences between a maize TrpA gene and 
other organisms. 

Organisms" % amino acid % amino acid 

compared Similarity Identity 



Haloferax volancii 56.4 36.1 

Methanococcus voltae 58.1 35.1 



62.5 41.8 
61.4 33.3 



Pseudomonas aeruginosa 
Neurospora crassa 
Saccharomvces cerevisiae 56.7 36.1 
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;i;i:;;i;;~g^pin;s, i*—*, d- e . **. -q. s= T 

Similarities and indentities were done using the GAP program 
from "WGCG. wford ^ ^ .„ Rev Microbiol., 43:567-600 
(1989) , incorporated herein by reference, found regions of 
conserved amino acids in bacterial trpA genes . These are am.no 
acids 49 to 58, amino acids 181 to 184, and amino acids 213 to 
216, with the rest of the gene showing greater variability than 
is seen in the TrpB sequence. An alignment of known trpA 
proteins with the maize TrpA protein (not shown, illustrates 
that the homology between the maize gene and other trpA 
proteins is considerable. Also, it is comparable to the level 
of homology observed when other TrpA proteins are compared to 
each other as described in Crawford et al., supra. 

To determine the location of the transcription start 
site and whether or not there were introns present in this 
region, four polymerase chain reaction (PGR) generated 

* «. io9 ho to 427 bp from the region -429 bp to 

fragments of about 122 bp to i »p 

+ 372 bp were used for northern analysis. The results of the 
northerns showed that PGR probes II, HI. XV hybridized to pith 
total RNA and PGR probe I did not hybridize. This indicated 
that the transcription start was in the -69 bp to +90 bp 
region. To more precisely locate the transcriptional start 
site, primer extension was employed. Fig. 29A shows that when 
a primer (#73) located at +90 bp relative to the 
transcriptional start is used for primer extension, the 
transcriptional start site is located at +1, 1726 bp on the 
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genomic sequence . 

The first ATG from the transcriptional start site is at 
+114 bp. This is the ATG that would be expected to serve as 
the site for translational initiation. This ATG begins an open 
reading that runs into the open reading frame found in the cDNA 
clone. The first 60 amino acids of this predicted open reading 
frame strongly resemble a chloroplast transit peptide. See 
Berlyn et al. PNAS , 86:4604-4608 (1989) and Neumann-Karlin et 
al., EMBO J. f 5:9-13 (1986). This result suggests that this 
protein is targeted to a plastid and is likely processed to 
yield the active protein. Transient expression assays in a 
maize mesophyll protoplast system using a maize optimized B.t. 
gene driven by the trpA promoter showed that when the ATG at 
+114 bp is used as the fusion point, the highest levels of 
expression are obtained. Using either of the next two ATGs in 
the sequence substantially reduces the level of expression of 
the reporter gene. The ATG at +390 bp gave some activity, but 
at a much lower level than the +114 ATG, and the ATG at +201 bp 

gave no activity. 

Athough a number of TATA like boxes are located 
upstream of the upstream of the transcriptional start site at 
+1 bp, the TATAAT at -132 bp is most like the plant consensus 
of TATAAA. See Joshi, Nuc . Acids Res. , 15:6643-6653 (1987). 
The presumptive CCAAT like box was found at -231 bp . The 
nucleotide sequence surrounding the ATG start (GCGACATGGC) has 
homology to other maize translation starts as described in 
Messing et al., Genetic Engi neering of Plants: An Agricultural 
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active , plenum Press, pp. 211-227 (1983), but differs fro. 
that considered a consensus sequence in plants (AN^TCGC) . 
See, Ooshi, above. The presumptive poly (A) addition signal is 
located at 3719 bp (AATAAA) on the genomic sequence, 52 bp from 
the end of the cDHA. The sequence matches known sequences for 
maize as described in Dean et al., Nnc Acids Res. , 
14:2229-2240 (1986), and is located 346 bp downstream from the 
end of protein translation. See Dean et al., Nnc Acids Res. ., 
14:2229-2240 (1986). The 3' untranslated sequence of the cDNA 
ends at 3775 bp on the genomic sequence. 

Fig. 28 shows a Southern blot of maize 211D genomic DNA 
with the approximate gene copy number as reconstructed using 
pith gene 8-2 cDNA. From the restriction digests and 
reconstruction there appear to be 1-2 copies of the gene 
present per haploid genome. There do not appear to be other 
genes with lower levels of homology with this gene.. Therefore, 
this represents a unique or small member gene family in maize. 

EXAMPLE 18. RNase Protection 

The structure of the 5' end of the mFHA »as determined 
using «.». protection. The FM.se protection was carried out 
using a probe representing 385 nt from +2 hp to +387 bp. Th,s 
region from the genomic clone was placed in the RHA 
transcription vector pM-Slf <+> and a 32P labelled RNA probe 
generated using SP6 polymerase. The probe and the extra bases 
from the multiple cloning site produce a transcript of 461 nt. 
The probe was hybridised with total pith » and subsequently 
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digested with a mixture of RNase A and Tl and the protected 
fragments analyzed on denaturing polyacrylamide gels. Analysis 
of the gels shows a protected fragment of about 355 nt and 
another fragment of about 160 nt. See Fig. 29B. 

The fact that primer extension using a primer (#73) at 
+80 bp produces a product of 90 NT in length argues that the 
5' end of the transcript is located at position +1 bp. Primer 
extension from a primer in this region produces a product, so 
one would expect this also to be detected by the RNase 
protection assay. This primer is located in the 5' region of 
the RNase protection probe. The cDNA clone contains sequences 
present in the 3' end of the RNase protection probe and hence 
were expected to be protected in this assay. Since only one 
band is present on the gel which could account for both of 
these sequences, we are confident that the protected fragment 
is indeed the larger band and that the smaller single band is 
an artifact. If there were an intron in this region, fragments 
from each end would be present in the probe, and hence would be 
detectable on the gel. Of the two bands seen, one of them 
appears to represent the entire 5' region, therefore we do not 
believe that there is an intron located in this region. 

EXAMPLE 19. Complementation of coli TrpA Mutant with the 

Pith cDNA 8-2 

Ej. coli strain CGSC strain 5531 from the E. coli 
Genetic Stock Center, Yale University (O.H. Smith lab strain 
designation, #M5004) with chromosomal markers glnA3, TrpA9825, 
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Sentet., 137:131-142 ,1.75,. - trans.or.ed with either 
pith (TRpA, cDHA .-2 or Bluescript placid (Stratagene, as 
described in sambrooz at .1.. supra. The transferors 
containing the TrpA 8-2 had the ability to grow without 

the presence of tryptophan on minimal medium whereas the 

RiuescriPt (Stratagene) plasmxd or 
transformants with the Bluescript. 

transformed control were not able to grow without tryptophan 
The cells transformed with the maize TrpA gene grew very slowly 
with colonies visible after seven days growth at room 
temperature. All strains were grown on * minimal medium 
supplemented with 200 ug/ml glutamine, 0.01 ug/ml thiamxne and 
with or without 20 ug/ml tryptophan. All transformants were 
Cheered for the presence of the appropriate plasmid by 
restriction enzyme analysis. Colonies growing in the absence 
of tryptophan all contained clone 0-2 containing the cDNA for 
the putative maize TrpA gene, as confirmed by Southern 
hybridization .data not shown, . These results support the 

1 . ^ a mai2e tryptophan synthase subunit A 
conclusion that this is the maize tryptop 

protein . 

EXMjPIg 20. Gene Expression 

^ expression pattern of the pith-preferential gene 
throughout the plant was examined. Different maize genotypes 
we re also exa.ined for patterns of expression of 
The following tissues were used as the source of BHA for these 
studies: upper, middle, and lower pith, brace roots, ear 
st .,„„ i OM er Pith, 10 day 

shank, cob in genotype 5N984; upper, middle, lower P 
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old leaves, 14 day old roots and pith from the entire plant in 
genotype 211D, and seed from genotype 211D which had been 
harvested at weekly intervals one to five weeks 

post-pollination. Lower pith is derived from, i.e. constitutes 
the two internodes above brace roots; middle pith is derived 
from the next three internodes; upper pith represents the last 
two internodes before the tassel in 60 and 70 day plants. Only 
two internodes were present in 39 day old plants and three 
internodes for 46 day old plants. Northern blot analysis shows 
that transcripts hybridizing with a probe derived from the pith 
cDNA accumulate rapidly in young pith and young leaf. As the 
age of the plant increases and one moves up the stalk, there is 
a significant decrease in the amount of transcript detected. 
See Figs. 25A-D. At no time is message from this gene detected 
in seed derived RNA, either total RNA or poly A+ RNA. See Fig. 
26. Transcript is also detected in root, earshank, and sheath 
but not at the high levels detected in the pith and young leaf 
tissues. See Figs. 25B, 25C. Some message is detected in 
brace roots, but only at a very low level. See Fig. 25D. Six 
maize undifferentiated callus lines were analyzed by northern 
blot analysis and no expression was found for this gene (data 
not shown) in any callus sample. The level of expression of 
this gene is extremely high since a very strong signal to a 
probe from TrpA gene 8-2 can be detected in pith and leaf as 
little as two hours after exposure of the blot to film (Fig. 
25A) . The amount of mRNA made is comparable to that derived 
from the maize phosphoenolpyruvate carboxylase gene disclosed 
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. al Moj - Biology , 12:579-589 (1989), 

in Hudspeth et al . , Utt -» incorporated 

another highly expressed maize gene. Hudspeth 

ner ein by reference. 

The expression pattern of this gene x 

w v, * n the lower and middle pxth 
constant. Expression is very high xn the low 

, tha n 60 days old and decreases rapidly near the 
of plants less than 60 cays oxa 

„P of the Plant. « «- Pl-t reaches -aturity, 

old, the expression drops to nearly 
^ in the lower Pit, an, earshan*. The acc-«- of 

lflaf ts nearly as high as that seen xn 
transcript in young leaf is nearly undetec tahle 
lower pith but expression decreases rapxdly and 

xowet p-u^ , ,p was found to 

in leaves over 40 days of age. Expr.ss.on » leaf 
be variable depending on the season when it is grown. 

Examples 21-3, set forth helow are direoted to the 
isolation, characterization and expression analysis of a 
pollen-specif ic propter according to the present rnvent.on. 

Identification of pollen-specific proteins 

p.a-nole 21- Maize Plant Growth 

Plants <*.= »ays Fun* inhred 2U» -are grown 
«. seed in a ver»lculite,sand fixture in a greenhouse coder a 
16 hour light/ 8 hour dark regime. 
Example 22. Total Pollen Protein Isolation 

Mature pollen was isolated from maize plants at 

sieved to remove debris , 

time of maxima pollen shed. It was 

frozen in liquid nitrogen, and a 3-, -I volume of 

T-1S0 P- g-s heads. 4. -X of grinding huffer *- 
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DTTf 0.1% SDS, 100 mM Hepes P H 8) was added and the mixture was 
ground again. The glass beads and intact pollen grains were 
pelleted by low speed centrifugation, and mixture was clarified 
by centrifugation at 10,000 g for 15 minutes. Protein was 
precipitated from the supernatant by addition of acetone to 

90%. 

Example 23. Pollen Exine Protein Isolation 

Exine Protein was isolated from maize 211D shed pollen 
as described in Matousek and Tupy, J., Plant Physiology 
119:169-178 (1985). 
Example 24. Leaf Protein Isolation 

Young leaves (about 60% expanded) were cut from the 
maize plant the midrib removed. Total protein was isolated as 
for pollen, except that the material was not frozen and 
grinding was in a Waring blender without glass beads. 
Example 25. Kernel Protein Isolation 

Ears with fully developed, but still moist kernels were 
removed from the plant and the kernels cut off with a scalpel . 
Total protein was isolated as for leaves. 
Example 26. Gel Electrophoresis of Maize Proteins 

Pollen, leaf and kernel proteins were separated on SDS 
polyacrylamide gels as described in Sambrook et al, Molecular 
rm^nn. A Lah^n.v Manual, Cold Spring Harbor Laboratory 
Press: New York (1989) . Following staining by coomasie blue, 
protein bands from pollen, leaf and kernel were compared and 
abundant proteins of approximately 10 kD,13 kD, 20 kD, 45 kD, 
55 kD and 57 kD were determined to be pollen specific. 
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xaentif ication of Pollen-Specific cDNA clones 
Example 27 . Partial seance Oetermination of Pollen-Specrf rc 

'""^Protein bands determined to be pollen-specific were 

pnrified by electroblotting from the ^^ o ^. 

• _ o T Biol. Chem. 261. xuuj^ 
PVDF membrane (Matsudaira, P., J. . **- 03 " - 

(l M7, , or by reverse phase HP^. H-cerninal serene e of t 
pnrified proteins was determined by automated Edman egradat.on 
with an Applied Biosystems 470* gas-phase sequencer 
Pbenylthiohydantoin «PTH, amino acids -ere identified usmg 

internal 



Applied Biosystems 120A PTH analyser. To obtain internal 



Applied Diusyo^- 

„™t„ins were digested with endoproteinase Lys-C 
sequence, proteins were g ^ ^ ^ 

.Boehringer Mannherm, m 0.1 M . P 



« room temperature using an enzyme -.substrate ratio of .10. 
Kesulting Peptides were isolated by HPiC using an Ao.ap.re 
columl eluted with a linear acetonitrile/isopropano 
ratio, gradient (0 to 60%) in 0.1% TEA. Sequence of isolated 
Lys -C peptides was determined as above. The following 
Lys c pept p 0ll en-sp.clfi= protein 

sequences were determined for the ijku P 



N-te minus : 
LysC 61: 

LysC 54: 
LysC 49: EHGGDDFSFTLK 
LyS C 43: EGP TGTWTLDTK 



TTPLTFQVGKGSKPGHL1LTPNVATI 
KPGHLILTPNVATISDVVIK 
SGGTRIADDVIPADFK 



Example 28 . synthesis of Oligonucleotide Probes for 
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Pollen-Specific cDNAs 

Regions of peptide sequence in the 13kD protein with 

low codon redundancy were selected, and suitable 
oligonucleotide probes for the gene encoding these regions were 
synthesized on an Applied Biosystems 380A synthesizer. The 
following oligonucleotides were synthesized: 

Oligo #51 5'-AA ATC ATC ACC ACC ATG TTC-3' 



Oligo # 58 5'-CC TTT ACC CAC TTG AAA-3' 



where the columns of nucleotides represent bases that were 
incorporated randomly in equal proportions at the indicated 
position in the oligo. Oligo #51 encodes the amino acid 
sequence EHGGDDF found in peptide LysC 49, and Oligo #58 
encodes the amino acid sequence FQVGKG found in peptide 
N-terminus. Use of these mixed oligonucleotides to screen a 
CDMA library for the pollen-specific gene will be described 
below. 

Example 29. Construction of a maize pollen cDNA library 
Total maize BNA from maize 211D shed pollen was 
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isolated as described in Glisen et al, Biochemijltrv 
i3-2633- 26 37 ,1.7.. . Poly « mRKA — P"*"- 
,s described in Sambrook et al. Using this mSKA, d» — 
psepared using a CDMA synthesis sit purchased from Promega, 

, v4t . T K e EcoRI linkers 

following protocols supplied with the kit . The Ec 

were added to the cDHA and it -as lifted into arms of the 
cloning vector lambda 2 ap, purchased from Stratagene and using 
the protocol supplied by the msnuf acturer . The ligation 
product was packaged in a lambda packaging extract also 
purchased from stratsgene, and used to infect L coli BB4 

cells. 

Example 30 . Isolation of pollen-specific cDHA clones 

The maize pollen cDNA library was probed using the 

^ flr i f 4 C f or the 13kD protein 
synthetic oligonucleotides probes specific for th 

gene , ss described in Sambrook et al. Briefly, about 
phage plagues of the pollen cOKA library were plated and Irfted 
to nitrocellulose filters. The filters were probed uszng 
oligonucleotides .51 and #50 which had been 32P end-labeled 

kinase The probes were hybridized to 
using polynucleotide kinase. J. 

the filters at low stringency <50 degrees C in 1M MCI. 10% 
dextrsn sulfate, 0.5* SDS, . washed 30 minutes at room 
temperature and then 30 minutes at 45 degrees C in 6X SSC. . 
SD S. and exposed to X-ray film to identify positive clones. 
Putative clones -ere purified through four rounds of plague 
hybridization. Three classes of cD»A clones -ere isolated. 

sP n o v-v. and 1.8 kb. Type I 1 

Type I contained EcoRI fragments of 0.2 kb and 

n k Yb 0 5 Kb and 1.0 kb, and 
contained EcoRI fragments of 0.6 to, 0.3 » 
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Type III contained an EcoRI fragment of 2.3 kb. 

Example 31. Characterization of Pollen-specific cDNA clones 

The EcoRI fragments of the Type II cDNA clone were 
subclone* into the plasmid vector pBluescript SK + , purchased 
from Stratagene. See Fig. 30. The 0.6 kb fragment in 
pBluescript was named II-. 6, the 0.5 kb fragment in pBluescript 
was named II-. 5 (later renamed P CIB3169) and the 1.0 kb 
fragment in pBluescript was named II-1.0 (later renamed 
PCIB3168). AS will be described below, the 0.5 kb and 1.0 kb 
fragments encode the maize pollen-specific CDPK gene. RNA from 
anthers, pollen, leaf, root and silk was denatured with 
glyoxal, electrophoresed on a 1% agarose gel, transferred to 
nitrocellulose, and probed separately with the three EcoRI 
fragments that had been labeled with 32P by random primer 
extension as described in Sambrook et al, Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press: New 
York (1989) . The blots were exposed to X-ray film, and an mRNA 
band of approximately 1.5 kb was identified with the 0.6 kb 
fragment probe, while the 0.5 and 1.0 kb fragments hybridized 
to an approximately 2.0 kb mRNA. In all cases hybridization 
was only seen in the pollen RNA lane, with the exception that 
the 0.6 kb fragment showed a slight signal in anther mRNA. The 
conclusion from these data was that the original cDNA clone was 
a fusion CDNA molecules derived from two different mRNAs . The 
0.6 kb fragment was a partial cDNA of a 1.5 kb pollen-specific 
mRNA, and this mRNA encodes the peptides LysC 49 and 
N-terminus. The 1.0 and 0.5 kb fragments comprise a partial 
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cdna or a 2 .0 >=b P.ll=n-.P.=i«c — • «— 1— " PeP " d " 
^ 01 igonueleotide proves used for probes. This conclusion was 
verified when Che fragments were sequenced using the dideoxy 
chain termination method as described in sambroo* et .1. The 
cDNA sequence is shown in Fig. 31. 

Example 32. Determination of specificity of mRNA expression 

To determine if the 2 . 0 kb RNA represented by cDNA 
clones PCIB3169 and P CIB3168 were present only in pollen, total 
RNA was isolated from maize 211D roots, leaves, pollen, anthers 
or silks. The RNAs were denatured with glyoxal, 
electrophoresed on a 1% agarose gel, transferred to 
nitrocellulose, and probed with 32P-labeled EcoRI insert from 
plasmid PCIB3168 or P CIB3169, all using standard techniques as 
described in Sambrook et al. Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory Press: New York (1989) . 
Exposure of this blot to photographic film demonstrates that 
the gene represented by these two clones is only 
transcriptionally active in the pollen (Fig. 32) - 

Identification of a Pollen-Specific Promoter 
Example 33. Construction of a Maize Genomic DNA Library 
Genomic DNA from maize line 211D young shoots was 

•v^h < n «?h™ et 1. Cell 35:225-233 (1983). 
isolated as described in snure et a, isSii 

The DNA was provided to Stratagene, where a genomic DNA library 
was constructed by cloning Sau3A! partially digested DHA into 
Stratagene' s Lambda Dash cloning vector. 

Example 34. Genomic DNA Blot Hybridization to Determine Gene 
Copy Number. 
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Genomic DNA from maize line 21 ID was digested with a 
number of restriction enzymes, the individual digests 
electrophoresed on an agarose gel, transferred to 
nitrocellulose, and probed with 32P-labeled EcoRI insert from 
plasmid pCIB3168 (1.0 kb fragment), pCIB3169 (0.5 kb fragment) 
or clone II-. 6 using standard techniques described in SambrooJc 
et al. More than 10 bands were detected by the II-. 6 probe on 
most digests, indicating that this cDNA is derived from a 
large, multigene family. Probing with the 1.0 kb fragment 
detected from 3 to 6 bands, and probing with the 0.5 kb 
fragment detected only from 1 to 3 bands which were a subset of 
those detected by the 1.0 kb fragment. Due to the smaller gene 
family size detected by the 1.0 kb and 0.5 kb fragments, it was 
decided to attempt to isolate the genomic clone corresponding 
to them . 

Example 35. Isolation of a pollen-specific genomic clone 

The Stratagene maize 211D genomic library was screened 
by probing plaque lifts with 32P labeled inserts from plasmid 
pCIB3168 (1.0 kb fragment) and pCIB3169 (0.5 kb fragment) using 
standard procedures as described in the Stratagene manual 
accompanying the library. Using this strategy, Lambda clone 
MG14 was isolated, and it hybridized to both probes. The 9.0 
kb BamHI fragment of MG14, which also hybridized to both 
probes, was subcloned into the BamHI site of pBluescript SK+ to 
create plasmid pCIB379. 1800 bp of pCIB379, in the region 
corresponding to the cDNA sequence, was sequenced as described 
above. Comparison of the cDNA and genomic sequences showed 
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only ,l, identity. PCXB379 insert represents a related 

pollen-specif i= gene, constructed in 

K second maize 211D genomic library was 
th . vector lambda GEH-11. purchased from Promega, using the 
pr ocedures descrihed. in the Pr=mega manual, screening this 
oo-amplified library as above yielded clone -11-1. 
hy bridized to both 0.5 and 1.0 kb probes. The 20kb HindXH 
ft agment of -11-1. which also hybridized to both probes was 
stoned into the HindXII site of pBluescript SK + to yield 

« . ,», o< DCIB3166 was determined 
PCIB3166. The DNA sequence of 4.1kb of P CIB3l 

(Fig . ,«, and after accounting for six introns in the genomic 
clone, was 100* identical to the cOKa sequence of pCIB3166 and 
PCTB3169. Comparison of the PCIB3166 sequence to the 
Genbank/EMBL database revealed that the 5' portion, through the 
3 exon. was 34 . S* identical to rat calmodulin-dependent pro tern 

, ixpArr ^\ while the fourth 
kinase II at the amino acid level (Fig. 33), whir 

thI ough seventh exons were 39.4* identical to rat calmodulin, 
see Pig- 34. Ho other pollen-specific Kinase has been 
described, and at the time this a protein combining kinase and 
calmodulin domains was unknown . Subsequently, Harper et al 

,. enlrt «H the cDNA sequence or 
science 252=951-954 (1991) have disclosed the 

a similar protein from soybean, although this gene is not 
pollen-specific in expression. Comparison of the soybean 
calcium-Dependent Protein Kinase <CCPK, and the maize pollen 
CDPK reveals 39* identity at the amino adid level, see Fig. 

Txlmple 36. Xdentif ication of the Promoter's Transcriptional 
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Start Site by Primer Extension 

Oligonucleotide PE51, with the following sequence was 

synthesized as a primer. 

5 ' -TGGCCCATGGCTGCGGCGGGGAACGAGTGCGGC-3 ' 
Primer extension analysis was carried out on polyA+ 
pollen mRNA as described in Metraux et al., PNAS USA 86:896-890 
(1989) . The transcription initiation site was determined to be 
between bases 1415 and 1425 on the partial sequence of pCIB3166 

shown in Fig. 36. 

Testing Promoter Function in Transgenic Plants 

Example 37. Construction of promoter vectors for plant 
transformation 

To demonstrate that the pollen CDPK promoter can drive 
expression of a linked gene in transgenic plants , a gene fusion 
of the pollen CDPK promoter to the Beta-glucuronidase gene of 
E. coli was constructed as follows. The lOkb BamHI fragment 
from lambda GEM11-1 containing the first exon and part of the 
first intron of the pollen CDPK gene plus 9 kb upstream of the 
gene was subcloned into the BamHI site of pBluescript SK+ to 
create plasmid pCIB3167. The 2.3 kb BamHI -Hindlll fragment 
from pCIB3167 was subcloned into the BamHI and Hindlll sites of 
pBluescript SK+ to create plasmid pSK105. The pSK105 was 
digested with Aval and Hindlll, and thel.75 kb Hindlll-Aval 
fragment was isolated on an agarose gel- A PCR reaction was 
run under standard conditions as described in Sambrook et al. 

■ 

using intact pSK105 as a template and the following primers: 
#42 : 5' -AGCGGTCGACCTGCAGGCATGCGATCTGCACCTCCCGCCG-3' 
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#43- 5 '-ATGGGCAAGGAGCTCGGG-3 

Th . pcR reaction products -ere digested with Ave! and 
sail and the resulting fragment isoiated on an agarose gel. 
prescript SK+ »aa digested with Hindlli and Sail. The 1-75 
Hindlll-Aval fragment. PCR derived Aval-Sail fragment, and 
pBluesoript vector with BindlH and Sail ends were iigated ,n a 
three way ligation to create plasmid pSKHO. 

A fusion of the promoter fragment in pSKHO to the 
Beta-glucuronidase <G0S. gene was created by digesting pSKilO 
with Hindi!! and sail, isoiatin, th. l.«b fragment on an 
agarose gei and iigating it into HindXII and Sail sites of 
PCIB3054, to create plasmid P KL2, a plasmid derived from PUC19 
containing the GOS gene followed hy Plant intron from the marre 
PEPC gene and a polyA signal from cauliflower mosaic vrrus . 
This promoter fusion was inactive in plants, probahly due to 
the pre sence of out of frame ATG codons in the leader seguence 

preceding the GOS gene ATG. 

A function fusion of the promoter was created by 
digesting pKL 2 with Xhal and Sail to remove the previous fusion 
.unction. A new fusion junction was produced in a PCR reaction 
using PSK105 as a template and the following primers: 

#SK50 : 5' -CCCTTCAAAATCTAGAAACCT-3 ' 

# SK4 9 : 5' -TAATGTCGACGAACGGCGAGAGATGGA- 3 ' 

. , „ aB( . oH with Xbal and Sail and 
The PCR product was digested wicn w« 

purified on an agarose gel . The purified f ragment was iigated 
into the Xbal and Sail sites of pKL2 to created plasmid 
PCIB3171. This Plasmid contains a functional fusion of pollen 
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CDPK prompter and GUS which directs expression the GUS gene 

exclusively in pollen. 

To create a vector containing the pollen CDPK 
promoter-GUS fusion suitable for use in Agxobacterium 
tumefaciens-mediated plant transformation, the fusion gene was 
isolated from pCIB3171 by digestion with Hindlll and Sail. The 
resulting fragment was ligated into the Hindlll and Sail sites 
of pBHOl (purchased from Clontech) to create plasmid pCIB3175. 
Example 38. Production of Transgenic Plants 

PCIB3175 was transformed into Agrobacterium tumefaciens 
containing the helper plasmid pCIB542, and the resulting 
culture used to transform leaf disks from tobacco shoot tip 

■ 

cultures as described by Horsch et al., Science 227:1229-1231 
(1985) except that nurse cultures were omitted and selection 
was on 100 mg/1 kanamycin. Transgenic plants were regenerated 
and verified for presence of the transgene by PCR. 
Example 39. GUS Gene Expression Analysis 

Pollen from primary transf ormants and their progeny 
were analyzed histochemically for expression of the GUS gene as 
described by Guerrero et al., Mni . Gen. Genet. 224:161-168 
(1990) . The percentage of pollen grains expressing the GUS 
gene, as demonstrated by blue staining in the X-gluc buffer, is 
shown in the table below. 



Plant Number % Blue Pollen 

PP1-51 28% 

PP1-54 54% 

PP1-55 none 

PP1-61 very few 

PP1-63 51% 
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PP1-67 
PP1-80 
PP1-83 



15% 
10% 
12% 



Primary transf ormants in which a single pollen CDPK 
promoter-GUS gene was integrated would produce a maximum 50% 
GUS positive pollen due to segregation of the single gene. 

Flouometric GUS assays were done on pollen, stem, root, 
leaf and pistil tissue of selected plants to demonstrate the 
specificity of pollen CDPK promoter expression. Assays were 
performed as described in Jefferson, Plant Mol . Biol, 
14:995-1006 (1990), and GUS activity values are expressed as 
nmoles MU/ug protein/minute. 

Net GUS 
Activity 



0 
0 

0.05 
0.01 
0.22 

0 
0 

0.03 
0 

0.58 

0 
0 
0 
0 

0.55 



Plant 
number 


Tissue 


GUS Activity 


Untransformed 

Plant GUS 
Activity 


PP1-51 


stem 
leaf 
root 

pollen 


0.01 
0 

0.15 
0.02 
0.24 


0.02 
0 

0.10 
0.01 
0.02 


PP1-54 


stem 

leaf 

root 

pistil 

pollen 


0.01 
0 

0.13 
0.01 
0.60 


0.02 
0 

0.1 

0.01 

0.02 


PP1-63 


stem 

leaf 

root 

pistil 

pollen 


0.01 
0 

0.07 
0.01 
0.57 


0.02 
0 

0.1 

0.01 

0.02 



Examples 40-50 are directed primarily to the preparation of 
chimeric constructs, i.e. recombinant DNA molecules, containing 
constitutive, tissue-preferred, or tissue-specific promoters 
operably linked to an instant gene, insertion of same into 
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vectors, production of transgenic platns containing the 
vectors," and analysis of expression levels of proteins of 

the transgenic plants . 

EXAMPLE 40: CONSTRUCTION OF MAIZE OPTIMIZED BT 

TRANSFORMATION VECTORS 

To demonstrate the effectiveness of the synthetic Bt 
crylA(b) gene in maize, the PepC and pith specific promoters 
are fused to the synthetic Bt crylA(b) gene using PCR. 
Oligomers designed for the PCR fusions were: 



(PEPC) 

KE99A28 " 5 ' -TGCGGTTACC GCCGATCACATG-3 ' 

KE97A28 = 5 ' -GCGGTACCGC GTCGACGCGG ATCCCGCGGC GGGAAGCTAAG - 3 ' 



(PITH) 

KE100A28 = 5 ' -GTCGTCGACC GCAACA-3' 

KE98A2B = 5' -GCGGTACCGC GTTAACGCGG ATCCTGTCCG ACACCGGAC-3' 
KE104A2B = 5' -GATGTCGTCG ACCGCAACAC-3' 

KE103A28 = 5' -GCGGTACCGC GGATCCTGTC CGACACCGGA CGGCT-3' 

PCR primers are designed to replace the Nco I sites in 
the 5' untranslated leader region of each of these tissue 
specific genes (containing ATG translatiohal start sites) with 
Bam HI sites to facilitate cloning of the synthetic crylA(b) 
gene into this Bam HI site. Subsequent construction of vectors 
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containing the tissue specific promoters fused to the synthetic 
crylA(b) gene and also containing the 35S:PAT:35S marker gene 
involves several intermediate constructs. 
1. pCIB4406 (35S:syntbetic-cryIA(b) :pe P C ivs#9:35S) 

PCIB4406 contains the 2 Kb Bam HI\Cla I synthetic 
crylA(b) gene fused with the CaMV 35S promoter (Rothstein et 
al., Gene 53:153-161 (1987)). The gene also contains intron #6 
derived from the maize PEP carboxylase gene (ivs#9) in the 3' 
untranslated region of the gene, which uses the CaMV 3' end. 
(PNAS_USA, 83:2884-2888 (1986), Hudspeth et al., Plant 
Moj ecular Biology , 12: 579-589 (1989)). P CIB4406 is ligated 
and transformed into the "SURE" strain of coli cells 
(Stratagene, La Jolla, CA, as described above. One mutation is 
found in P CIB4406's crylA(b) gene at amino acid #436 which 
resulted in the desired Phe being changed to a Leu. P CIB4406 
is fully active against European corn borer when tested in 
insect bioassays and produces a CrylA(b) protein of the 
expected size as determined by western blot analysis. 
2. pCIB4407 (35S:synthetic-cryIA(b) :pe P C ivs#9:35S + 

35S:PAT:35S) 

PCIB4407 is made from an approximately 4 Kb Hind 
ZIXNBco RI fragment containing the 35S:PAT:35S gene, and the 
3.1 Kb\Hind III\Eco RI 35S : synthetic-cry IA(b> :35S gene from 
PCIB4406. PCIB4407 is ligated and transformed into "SURE"/ 
DHSalpha, and HB101 strains of E, coli using standard 
procedures (SambrooK et al.). The synthetic crylA(b) gene has 
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the same properties as its precursor P CIB4406. 
3. pCIB4416 (35S : synthetic-cry IA <b) :pe P C ivs#9:35S + 

35S:PAT:35S + 35S:Adh intron :GUS : 35S . ) 

PCIB4407 is cut with Eco RI and treated with calf 
intestinal alkaline phosphatase (CIP) under standard conditions 
(Sambrook et al.) to produce an about 7.2 Kb fragment that is 
ligated with a 3.4 Kb Eco RI 35S :Adh\GUS : 35S fragment to 
produce pClB4416. Ligations and transformations into "SURE" 
cells is as described above. The synthetic crylA(b) gene in 
PCIB4416 has the same properties as the gene in P CIB4406. 
4. pCIB4418 (35S: synthetic-cry IA(b) :pepC ivs#9:35S) 

PCIB4406 is digested with Apa I and Bam HI and treated 
with CIP . pCIB4406 is digested with Bam HI and Nsp I. 
P BS123#13 is digested with Nsp I and Apa I. A three-way 
ligation is made consisting of a 4.3 Kb Apa I\Bam HI fragment 
from PCIB4406, a 1.3 Kb Bam Hl\Nsp I fragment from P CIB4406, 
and a 170 bp Nsp I\Apa I fragment from P BS123#13 to form 
PCIB4418. The host Ej. coli strain for P CIB4418 is HB101. 
5. P CIB4419 (35S: synthetic-cry IA(b) :pepC ivs#9:35S + 

35S:PAT:35S + 35S:Adh intron :GOS:35S.) 
PCIB4416 and pCIB4418 are digested with Bst E II and 
Eco NI and fragments of P CIB4416 are treated with CIP. A 9.1 
Kb fragment from P CIB4416 ligated to a 1.4 Kb fragment from 
PCIB4418 to form pCIB4419. P CIB4419 transformed in HB101 
competent E^ coli cells demonstrates full activity in insect 
bioassays against European corn borer. 
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6. PCIB4420 <Pith:synthetic-cryIA<b, :PEPC ivs#9:35S + 

35S:PAT:35S) 

intermediate constructs in making P CIB4420 are pBTinl, 
P Btin2, P 4420Aand P Btin3. pBtinl (pith promoter: second half 
of the synthetic Bt gene ♦ 35S:PAT:35S) is made by luting the 
2 1 Kb Xba iMteo I Pi^ Promoter fragment from plasmxd 
pith (3-D with a 5.2 Kb Xba Ittoo I fragment from P CIB4407. 
P Btin2 is an intermediate construct containing the pith 
promoter modified with a 210 bp PGR fragment made using prxmers 
KE100A28 and KE98A2B listed above. The PGR reaction mix 
contains approximately 100 ng of a 2.1 Kb Bam HlXNco X pit* 
promoter fragment with 100 pmol of each oligomer, 200 nM of 
each dHTP, 1 X buffer (Getus) and 2.5 units of thermal stable 
polymerase. Since the Tm is relatively low (between 40- and 
50-C). PGR reactions are run with the following parameters: 
denaturation cycle: 94-C for 1 minute 
annealing cycle : 37-C for 1 minute 

i» . -if »c for 45 seconds (+ 3 seconds per 
extension cycle : 72 c tor *o 

cycle) 

number of cycles: 25 

SCR reactions are treated with proteinase K as 
ascribed above prior to cutting with Sai XXKpn I followed by 
phenolVchloroform extraotion and ethanol precipitation as 
described above. The 210 bp fragment is purified on a 2 % 
„„sieve gel and extracted from the gel using Millipore' s filter 
units. The 210 bp sal XVKpn I fragment is ligated to the 4.9 
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Kb Sal I\Kpn I fragment from pith(3-l> to make P Btin2. P 4420A 
(pith:synthetic-Bt:Pe P intron:35S + 35S:PAT:35S) is made with a 
three-way ligation consisting of a 700 bp Nsi I\Bam HI fragment 
from P Btin2, a 1.8 Kb Bam HI\Bst E II fragment from pCIB4418, 
and a 5.9 Kb Bst E II\Nsi I fragment from pBtinl. After P 4420A 
is made three mutations are discovered in P Btin2. A second PCR 
fragment is made to modify the Nco I site in the pith leader 
using primers KE104A28 and KE103A2B with Tm values around 65'C. 
the PCR reaction mix is identical to that listed above with the 
addition of glycerol to 20% to reduce mutations in G+C rich 
areas (Henry et al. f Plant Mole cular BiolQQV Reporter 
9 (2) -.139-144, 1991). PCR parameters are as follows: 
File I: 94 »C : 3 minutes , 1 cycle 
File II: 60»C : 1 minute 

94 »C : 1 minute 

25 cycles 
File III: 72 »C : 5 minutes, 1 cycle 

PCR reactions are treated as above and cut with 
restriction endonucleases Sal I and Kpn I. The 210 bp Sal 
I\Kpn I PCR (glycerol in the reaction) fragment is ligated to 
the 4.9 Kb Sal I\Kpn I fragment from plasmid pith (3-1) to make 
P Btin3. Sequence data on P Btin3-G#l shows this PCR generated 

fragment to be correct . 

P Btin3-G#l is used to make P CIB4420 (also called P 4420B 
"G#6") . PCIB4420 is constructed with a three-way ligation 
using the 700 bp Nsi I\Bam HI fragment from P Btin3-G#l, a 1.8 
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Kb Bam HlXBst E II fragment from P CIB441B, and a 5.9 Kb Bst . 
HXNsi I fragment from pBtinl. P GIB4420 is used in mesophyll 
protoplast experiments and demonstrates full activity of the 
synthetic crylA(b) gene against European corn borer. 
7 PCIB4413 <PEPG:synthetic-Bt (Phe mutation) :PEPC intr on: 35S . ) 

A fusion fragment is generated by PGR using primers 
KE99A28 and KE97A28 with a 2.3 KB Hind III\Sal I template from 
PGUS4.5. The PGR mix contains the same concentration of 
primers, template, dNTPs, salts, and thermal stable polymerase 
as described above. PGR reaction parameters are: 

denaturation cycle : 94 «C for 1 minute 

annealing cycle : 55-C for 1 minute 

„„ rlo . 72 . c for 45 seconds (+ 3 seconds per 
extension cycle . it <- lot 

cycle) 

number of cycles: 30 

After completion. PCR reactions are treated with 
proteinase K followed by phenolXchloroform extraction and 
ethanol precipitation as described above prior to cutting »rtn 
restriction endonucleases Bam HI and Bst E II - 

PCIB4413 is made with a three-way ligation using the 
2X0 bp Bam HI\Bst E II PC* fragment, a 4.7 Kb Bam HIXHind III 

o n vt, Hind llI\Bst E II fragment 
fragment from P CIB4406, and a 2.2 Kb Hxnd III\b« 

from pGUS4 .5 . 

8 PCIB4421 (PEPC:synthetic-cryIA(b) :PEPC intron:35S.) 

PCIB4421 is made to replace the synthetic crylA(b) gene 
containing the Phe mutation in P GIB4413 with the synthetic 
crylA(b) gene from pCIB4419. PGIB4421 is made by ligating a 
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5.2 Kb Bam HI\Sac I fragment from P CIB4413 with a 1.9 Kb Bam 
Hl\Sac I fragment from pCIB4419. 

9. pCIB4423 (PEPC : synthetic-cry IA (b ) :PepC intron:35S + 
35S:PAT:35S) 

The 2.4 Kb Bam HI\Hind III PEPC promoter fragment from 
PCIB4421 is ligated to the 6.2 Kb Bam HI\Hind III fragment in 
PCIB4420 to make pCIB4423. The Hind III site is deleted by 
exonucleases in the cloning of P CIB4423. P CIB4423 contains the 
synthetic crylA(b) gene under the control of the PEPC promoter, 
and the PAT gene under the control of the 35S promoter. 

10. Synthetic crylA(b) gene in Aarobacterium strains: 

Aarobacterium strains made with the synthetic crylA(b) 
gene allow transfer of this gene in a range of dicotyledenous 
plants. Aarobacterium vector P CIB4417 contains the 3.3 Kb Hind 
IIINEco RI 35S : synthetic-Cry IA <b) :Pe P C:ivs* 9: 35S fragment from 
PCIB4406 (Phe mutation) ligated to the 14 Kb Hind IIINEco RI 
fragment from pBIlOl (Clontech)'. Using electroporation, 
PCIB4417 is transferred into the a. tumefaciens strain LBA4404 

(Diethard et al., Nucleic Acids Research, Voll7 : #16 : 6747, 

1989.). 

200 ng of pCIB4417 and 40 ul of thawed on ice LBA4404 
competent cell are electroporated in a pre-cooled 0.2 cm 
electroporation cuvette (Bio-Rad Laboratories Ltd.). Using 
Gene Pulser-TM with the Pulse Controller unit (Bio-Rad) , an 
electric pulse is applied immediately with the voltage set at 
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« e t at 25 uF. After the pulse, cells 
2.5 kV, and the capacity set at ^ ur 

«-« i ml of YEB medium and shaken at 
are immediately transferred to 1 ml o£ x*. 

27 c for 3 hours before plating 10 ul on ABmin:Km50 plates. 
Mt er incubating at 28 C for approximately 60 hours colonies 
are selected for miniscreen preparation to do restriction 
enzyme analysis . The final M robacterlum strain is called 
pCIB4417 :LBA4404 . 

EXAMPLE 41: ELISB ANALYSIS OF TKANSFOR^ METffi PROTOPLASTS 

The presence of the crylA(b) toxin protein is detected 
b y utilizing enzyme-linked immunosorbent assay (ELISA) . ELISAS 
are very sensitive, specific assays for antigenic material. 
ELISA assays are useful to determine the expression of 
polypeptide gene products. Antiserum for these assays is 
produced in response to immunizing rabbits with 
gradient-purified Bt crystals [Ang et al., jppl led Enviro n. 
^ro^, 36:625-626 (1978)] solubilized with sodium dodecyl 
sulfate. ELISA analysis of extracts from transiently 
transformed maize cells is carried out using standard 
procedures (see for example Harlow, E., and Lane, D. xn 
-Antibodies: A Laboratory Manual", cold Spring Harbor 
Laboratory Press, 1988) . ELISA techniques are further 
described in Clark et al., Motho,, in Enzvmologv, 118:742-766 
(1 986>; and Bradford, AnaJ^iochem., 72:248 (1976). Thus, 
these procedures are well-known to those skilled in the art. 
The disclosure of these references is hereby incorporated 

herein by reference. 

ELISA assays are performed to detect the production of 

132 



WO 93/07278 PCT/US92/08476 

CrylA(b) protein in maize protoplasts. Protein produced is 
reported below as ng of Bt per mg total protein (ng Bt/mg) . 
Each construct was tested twice. 
pCIB3069 No detectable Bt (both tests) 
pCIB4407 21,900 ng Bt/mg total protein, 

21,000 ng Bt/mg total protein 
The transformed maize cells produce high levels, on the 
order of approximately 20,000 ng of Bt CrylA(b) protein per mg 
total soluble protein, of the Bt IP when transformed with the 
maize optimized Bt gene. The level of detection of these ELISA 
based assays is about 1 to 5 ng CrylA(b) protein per mg 
protein. Therefore, the maize optimized Bt gene produces as 
much as approximately a 20,000 fold increase in expression of 
this protein in maize cells . 

EXAMPLE 42: ASSAY OF EXTRACT FROM TRANSFORMED PROTOPLASTS FOR 
fcjXArlf Lit/ « %* SECTICIDAL ACTIVITY AGAINST EUROPEAN CORN BORER 

Western blot analysis is also performed using extracts 
obtained from maize cells which had been transiently 
transformed with DNA to express the maize optimized gene. When 
examined by western blots, this protein appears identical with 
the protein produced in E^ coli. In contrast, as demonstrated 
in Example 6 above, no detectable Bt crylA(b) insecticidal 
protein is produced by maize cells transformed with comparable 
vectors attempting to express the native Bt derived coding 
region . 

Qualitative insect toxicity testing can be carried out 
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*t>* orepared for each 
using harvested protoplasts. Suspensions are prep 

* irate is considered 

replicate tested in all bioassays. replicate 

•™.<<.<«mtlv higher mortality than the 
positive if it causes significantly nign 

controls. Tor example, replicates are tested for their 

activity against insects in the order tepido^ by using the 

E „opean com borer, Ostrinia nubilalis. One-hundred pi of a 

• n 1* Triton X-1O0 is pipetted onto the 
protoplast suspension in 0.1% Triton 

surface of artificial Blac* cutworm diet, <Bioserv, Inc., 
Prenchtown. W, F9240. in 50 mm X 10 mm snap-cap Petri dishes. 
Mt er air drying 10 neonatal larvae are added to each plate. 
Mortality is recorded after about 4 days . When this protein is 
red to European corn borers, it produces 100* mortality. 

43: EXPRESSION OF SYNTHETIC BT IN MAIZE MESOPHfLL 
PROTOPLASTS 

s - *t«t- t-he> isolation of corn 
The general procedure for the isoxatx 

mesopnyll protoplasts is adapted from Sheen et al., 

' 2:102 7-103S ,1990, . The protoplast transformation system 
us ed in Sheen et al. is modified by using PEG mediated 
transforation, rather than elaboration. That procedure, 
as «11 as changes made in the isolation procedure, is 



below . 

Maize Mesopnyll Protoplast Isolation/Transformation 

1. Sterilize and germinate corn seeds for leaf 
material. Seedlings are grown in the light at 25C. 

2 Surface sterilize leaf pieces of 10-12 day old 
seedlings with 5% Cloro* for 5 minutes followed oy several 
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washes with sterile distilled water. 

3. Aliquot enzyme solution (see recipe below); 25 

ml/dish (100x25 mm petri dish) . 

4. Remove any excess water from leaves and place 6-B 2 
inch pieces in each dish of enzyme. 14 plates are usually set 
up with the leaf material from about 100 seedlings. 

5. Cut leaves in longitudinal strips as thin as 

possible (2-5 mm) . 

6. Shake slowly at 25C for 6.5 to 7 hours. Cover plates 

so that incubation takes place in the dark. 

7. Before filtering protoplasts, wash 100 urn sieves 
with 10 ml 0.6 M mannitol. Pipet protoplasts slowly through 
sieves. .Wash plates with 0.6 M mannitol to gather any 
protoplasts left in the dishes . 

8. Pipet filtered liquid carefully into 50 ml sterile 
tubes. Add equal volumes of 0.6 M mannitol to dilute. 

9. Spin for 10 minutes at 1000 rpm/500 g in table-top 

centrifuge (Beckman Model TJ-6) . 

10. Remove enzyme solution and discard. Resuspend 
pellets carefully in 5 ml mannitol. Pool several pellets. Bring 
volume to 50 ml with 0.6 M mannitol and spin. 

11. Resuspend to a known volume (50 ml) and count. 

12. After counting and pelleting, resuspend protoplasts 
at 2 million/ml in resuspending buffer (recipe below) . Allow 
ppts to incubate in the resuspending buffer for at least 30 min 
before transformation. 

Transformation : 
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1-0 tubes (Fisberbrand polystyrene 
1 Aliquot plasmids to tubes (t^ 

„ , l0 . « snap cap culture tufces, ; at least tn.ee replicate, 
per treatment; use e,uimolar amounts of plasmids » «-t e<ual 
gene copy numbers are compared. 

,r,w n «; ml 40% PEG made wxtn 

2. Add 0.5 ml protoplasts and 0.5 nu. 

0.6 M mannitol. 

3. Sha.ce gently to mix and incunate at 2SC for 30 mm. 

4. Add protoplast culture media at 5 min intervals: 

1,2,5 ml 

5. Spin for 10 min at 1000 rpm/500 g. 

6. Remove liquid from pellet and resuspend in 1 ml 

culture media (BMV media) 

7. incubate overnight at 2SC in the dark. 



Enzyme Solution 
0.6 M mannitol 
10 roM MES, pH 5.7 
1 mM CaCL 2 
1 mM MgCl 2 
0.1% BSA 

filter-sterilize 

To this solution, add the following enzymes 
1% cellulase RS, and 0.1% Macerozyme R10 



Wash Buffer: 0.6 M mannitol, 



e -* ♦ 
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Resuspending Buffer: 0.6 M mannitol, 20 mM KC1, 
filter-sterilize 

Culture Media: BMV media recipe from: 

Okuno et al., Phytopathology 67:610-615 (1977). 

0.6 M mannitol 

4 mM MES, pH 5.7 

0.2 mM KH 2 P0 4 

1 mM KN0 3 

1 mM MgS0 4 

10 mM CaCl 2 

IX K3 micronutrients 

filter-sterilize 

ELISA analysis of transformed protoplasts is done one 
day after transformation. ELISA' s are done as previously 
described. The following three experiments are done with maize 
inbred line 211D. Of course, other lines of maize may be used. 
50 ug of plasmid pCIB4419 and equimolar amounts of other 
plasmids are used. Total soluble protein is determined using 
the BioRad protein assay. (Bradford, Anal .Blochem, 72:248 
(1976) . 

Transformation Expe riment : 
Constructs tested: 

1. pCIB4419 (Construct contains synthetic Bt under control of 
CaMV 35S promoter and 35S/PAT and 35S/GUS marker genes) 

2. pCIB4420 (Construct contains synthetic Bt under control of 
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Pith promoter and PAT marker gene) 

3. pCIB4421 (Construct contains synthetic Bt under control of 
PEPC promoter) 

4. pCIB4423 (Construct contains synthetic Bt under control of 
PEPC promoter and PAT marker gene) 

(PBEC:synthetic-cryIA(b) :Pe P C intron:35S + 35S:PAT:35S) 

in the following experiments, 10 or 11 day old 211D 
seedlings are analyzed for production of the Bt CrylA(b) 
protein in the Biorad protein assay: 

Experiment 1 (11 day seedlings) : 



pCIB4419 
pCIB4420 
pCIB4421 



15,000 ± 3,000 ng Bt/mg protein 
280 ± 65 ng Bt/mg protein 
9,000 ± 800 ng Bt/mg protein 



Experiment 2 (10 day seedlings) : 



pCIB4419 
pCIB4420 
pCIB4421 



5,000 ± 270 ng Bt/mg protein 

80 ± 14 ng Bt/mg protein 

1, 600 ± 220 ng Bt/mg protein 



Experiment 3 (11 day seedlings) : 



pCIB4419 



21,500 ± 1,800 ng Bt/mg protein 
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pCIB4420 
pCIB4421 
pCIB4423 



260 ± 50 ng Bt/mg protein 
ll f 900 ± 4,000 ng Bt/mg protein 
7,200 ± 3,400 ng Bt/mg protein 



The above experiments confirm that both the CaMV 35S 
and PEPC promoters express the synthetic Bt CrylA(b) protein at 
very high levels. The pith promoter, while less efficient, is 
also effective for the expression of synthetic CrylA(b) 
protein, 

EXAMPLE 44: STABLE EXPRESSION OF SYNTHETIC BT IK LETTUCE 

The synthetic Bt gene in the Agrobacterium vector 
PCIB4417 is transformed into T.actuca sativa cv. Redprize 
(lettuce) . The transformation procedure used is described in 
Enomoto et al., Pi*nr_ Cell Reports, 9:6-9 (1990). 
Transformation procedure: 

Lettuce seeds are suface sterilized in 5% Glorox for 5 
minutes followed by several washes in sterile distilled water. 
Surface-sterilized seeds are plated on half strength MS media 
(Murashige and Skoog, Phvsiol. Plant. 15:473-497 (1962)). 
Cotyledons of 6-day-old Redprize seedlings, grown under 
illumination of 3,000 Ix 16 hr at 25C, are used as the explants 
for Agrobacterium infection. The base and tip of each 
cotyledon are removed with a scalpel. The explants are soaked 
for 10 minutes in the bacterial solution which have been 
cultured for 48 hours in AB minimal media with the apropriate 
antibiotics at 28C. After blotting excess bacterial solution 
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on sterile filter paper, the explants are plated on MS 
,0.1 mg/1 BA and 0 . 1 mg/1 *AA> for 2 days. Explants are then 
transferred to selective media containing 500 mg/1 
carbenicillin and 50 mg/1 kanamycin. The explants are 
subcultured to fresh media weekly. The growth chamber 
conditions are 16 hour 2,000 lx light at 25C. After 
approximately 4 weeks, an ELISA is done on healtby looking 
callus from each of four plates being subcultured. The ELISA 
procedure is the same as described above for protoplasts; 
soluble protein is again determined by the Biorad assay 



described above 



0 
0 



pCIB3021 (kan control) 
pCIB4417 (plate 1) 
PCIB4417 (Plate 2) 505 ng Bt/mg protein 

pCIB4417 (plate 3> « ng Bt/mg protein 

PCIB4417 (plate 4, 1.200 ng Bt/mg protein 

This example demonstrates that dicot plants can also show 

• « oofimized insect icidal gene, 

increased expression of the optimizeo, i« 

Example 45. Construction of P CIB4429. 

PCIB4429 contains a preferred maize pollen-specific 
promoter fused with the maize optimized crylA(b) gene. The 
pollen-specific maize promoter used in this construct was 
obtained from the plasmid P KL2, described in Example 37. 1 
maize optimized crylA(b) gene was obtained from plasmid 
PCIB4418, also described in Example 37. 

PKL2 is a plasmid that contains a preferred maize 



140 



WO 93/07278 



PCI7US92/08476 



pollen-specific promoter fused with the L coli 
beta-glucuronidase gene. It was constructed from 
pSKHO and P CIB3054 . pSKHO contains the pollen specific maize 
promoter. pCIB3054, a P UC19 derivative, contains the E^. coli 
beta-glucuronidase (GUS) gene fused with the cauliflower mosaic 
virus (CaMV) 35S promoter. It's construction is described 
elsewhere in this application. This promoter can be removed 
from this plasmid by cutting with Sall/Hindlll to yield a 
fragment containing the GUS gene, a bacterial ampicillin 
resistance gene and a ColEI origin of replication. A second 
fragment contains the CaMV 35S promoter. 

PCIB3054 was cut with the restriction enzymes Sail and 
Hindlll, using standard conditions, for 2 hours at room 
temperature. The reaction was then extracted with 
phenol /chloroform using standard conditions and the DNA 
recovered by ethanol precipitation using standard conditions. 
The recovered DNA was resuspended in buffer appropriate for 
reaction with calf intestinal alkaline phosphatase (CIP) and 
reacted with 2.5 units of CIP at 37»C overnight. After the CIP 
reaction, the DNA was purified on an agarose gel using standard 
conditions described elsewhere in this application. pSKllO was 
cut with Sall/Hindlll under standard conditions for 2 hours at 
room temperature and the DNA subsequently purified on an 
agarose gel using standard conditions. The recovered DNA 
fragments were ligated using standard conditions for two hours 
at room temperature and subsequently transformed into competent 
E . coli strain HB101 cells using standard conditions. 
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Transforms were selected on L-agar containing 100 *g 
a^icillin/ml. Transf ormants were characterized for the desired 
plasmid construct using standard plasmid mini-screen 
procedures. The correct construct was named P KL2. 

To make P CIB4429, a three way ligation was performed 
using standard conditions Known to those in the art. The three 

fragments ligated were: 

1) a Hindlll/BamHI fragment from pCIB4418, of about 4.7 

Kb in size, containing the crylA(b) gene, the bacterial 
ampicillin resistance gene, and the ColEI origin of replication 

2) a Hindlll/Xbal fragment from P KL2 of about 1.3 kb xn 
size and containing the pollen specific promoter from maize 

3) a PCR generated fragment derived from the pollen 
promoter with a BamHI site introduced downstream from the start 
of transcription. This fragment is approximately 120 bp and 
has ends cut with the restriction enzymes Xbal/BamHI. 

The PCR fragment was generated using a 100 ul reaction 
volume and standard conditions described above. The primers 



used were: 

SK50: 5'-CCC TTC AAA ATC TAG AAA CCT-3' 

KE127: 5'-GCG GAT CCG GCT GCG GCG GGG AAC GA-3' 

The above primers were mixed in a PCR reaction with 
plasmid PSK105, a plasmid that contains the pollen specific 

promoter from maize. 

After the PCR reaction was complete, 10 ul of the 
reaction was run on an agarose gel, using standard condition, 
to make sure the reaction produced the expected size product. 
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The remaining 90 ul was treated with proteinase K at a final 
concentration of 50 pg/ml for 30 min. at 37-C. The reaction was 
then heated at 65-C for 10 min., then phenol/chloroform 
extracted using standard procedures. The DNA was recovered from 
the supernatant by precipitating with two volumes of ethanol 
using standard conditions. After precipitation, the DNA was 
recovered by centrifuging in a microfuge. The pellet was rinsed 
one time with 70% ethanol (as is standard in the art), briefly 
dried to remove all ethanol, and the pellet resuspended in 17 
Ul TE buffer. 2 ul of 10X restriction enzyme buffer was added 
as were 0.5 ul BamHI and 0.5 ul Xbal . The DNA was digested for 
1 hour at 37-C to produce a DNA fragment cut with Xbal/BamHI. 
After digestion with the restriction enzymes, this fragment was 

on an agarose gel composed of 2% NuSieve (FMC) /1% 
agarose gel. Millipore filter units were used to elute the DNA 
from the agarose using the manufacturer's specifications. After 
elution, the DNA was used in the three-way ligation described 
above. 

After ligation, the DNA was transformed into competent 
Ej. coli strain HB101 cells using standard techniques . 
Transformants were selected on L-agar plates containing 
ampicillin at 100 ug/ml. Colonies that grew under selective 
conditions were characterized for plasroid inserts using 
techniques standard in the art . 

EXAMPLE 46. Construction of pCIB4431, a vector for tissue 
specific expression of the synthetic crylA(b) gene in plants. 

pCIB4431 is a vector designed to transform maize. It 
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contains two cleric Bt endotoxin genes expressive in .size 
These genes are the PEP — s. promoter/synthetic-crylX.b, 
and a poilen promoter/synthetic-cryIMb) . The PEP 
carboxylase/cryDMb, gene in this vector is derive* from 
PCIE442! described above. The poilen promoter is also described 
above. Fig. 20 is a map of piasmid PCIB4431. PCIB4431 was 
constructed via a three part ligation using the about 3.5 Kb 
Kpn I/Hind III fragment (containing 

pollen/synchetic-crylACb.from pcIB4429, the about 4.5 Kb Hind 
HI/Eco RZ ,PBPC/synthetio-=ryIA(b) and the about 2.S Kb Kpn 
I/Eco RI fragment from the vector Bluescript . 

other vectors including the pollen promoter/synthetic 
CryUUb, chimeric gene include PCIB4428 and P«B4430. see Figs. 
21 and 22. PCIB4430 also contains the PEPC/synthetic-Et gene 



EXAMPLE 47. Production of trans' r £ Plants containing 
the synthetic maize optimized CryiA.oj gex 

The example below utilizes Biolistios to introduce DSA 
coated particles into maize, cells, from which transformed 
plants are generated. 
Experiment KC-65 

Production of transgenic maize piants expressing the synthetxc 

eryix(b) gene using a tissue-specific promoter. 

Tissue 

!mmature maize embryos, approximately 1.5-2.5 mm in length, 
were excised from an ear of genotype 6K615 14-15 days after 
pollination. The mother plant was grown in the greenhouse. 
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Before excision, the ear was surface sterilized with 20% Clorox 
for 20 minutes and rinse 3 times with sterile water. 
Individual embryos were plated scutellum side in a 2 cm square 
area, 36 embryos to a plate, on the callus initiation medium, 
2DG4 + 5 chloramben medium (N6 major salts, B5 minor salts, MS 
iron, 2% sucrose, with 5 mg/1 chloramben, 20 mg/1 glucose, and 
10 ml G4 additions (Table 1) added after autoclaving. 

TABLE 1 - G4 Additions 



Inaredient per 


liter 


medium 


Casein hydrolysate 


0 


.5 


gm 


Proline 


1 


.38 


gm 


Nicotinic acid 




.2 


mg 


Pyridoxine-HCl 




,2 


mg 


Thi amine -HC1 




.5 


mg 


Choline-HCl 




.1 


mg 


Riboflavin 




.05 


mg 


Biotin 




• 1 


mg 


Folic acid 




.05 


mg 


Ca pantothenate 




.1 


mg 


p-aminobenzoic acid 




.05 


mg 


B12 




.13< 


6 pg 



Bombardment 

Tissue was bombarded using the PDS-lOOOHe Biolistics 
device. The tissue was placed on the shelf 8 cm below the 
stopping screen shelf. The tissue was shot one time with the 
DNA/gold microcarrier solution, 10 ul dried onto the 
macrocarrier. The stopping screen used was hand punched at 
ABRU using 10x10 stainless steel mesh screen. Rupture discs of 
1550 psi value were used. After bombardment, the embryos were 
cultured in the dark at 25* C. 



Preparation of DNA for delivery 
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The microcarrier was prepared essentially according Co 
the instructions supplied with the Biolistic device. While 
vorcexing 50 ul 1.0 P gold microcarrier, added 5 pi P C IB 4431 

,1 23 pg/Ul) C#898, ♦ 2 Pi PCIB3064 0.895 pg/pl> t#4S« 

followed by 50 ul 2.5 M CaCl,, then 20 pi 0.1 M spermidine 

(free base. IC grade) . The resulting mixture was vortexed 3 

. fnr , n fiec The supernatant was removed 
minutes and microfuged for 10 sec. me su*. 

and the Aerocarriers washed 2 times with 250 ul of 100% EtOH 
(HPLC grade) by vortexing briefly, centrifuging and removing 
the supernatant. The microcarriers are resuspended in 65 ul 
100% EtOH. 



rallus f ormation 

anbryos were transferred to callus initiation medium 
with 3 mg/1 PPT 1 day after bombardment. Embryos were scored 
for callus initiation at 2 and 3 weeks after bombardment . Any 
responses were transferred to callus maintenance medium, 2DG4 + 
0 s 2,4-D medium with 3 mg/L PPT. Callus maintenance medium is 
N6 major salts, B5 minor salts, MS iron, 2% sucrose, with 0.5 
mg /l 2,4-D, 20 mg/1 glucose, and 10 ml G4 additions added after 
autoclaving. Embryogenic callus was subcultured every 2 weeks 
to fresh maintenance medium containing 3 mg/L PPT. All callus 

was incubated in the dark at 25»C. 

The Type I callus formation response was 15%. Every 
embryo which produced callus was cultured as an individual 
event giving rise to an individual line. 
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Regeneration 

After 12 weeks on selection, the tissue was removed 
from callus maintenance edium with PPT and was placed on 
regeneration medium. Regeneration medium is 0.25MS3S5BA (0.25 
mg/1 2,4 D, 5 mg/1 BAP, MS salts, 3% sucrose) for 2 weeks 
followed by subculture to MS3S medium for regeneration of 
plants. After 4 to 10 weeks, plants were removed and put into 
GA 7's. Our line KC65 0-6, which became the #176 BT event, 
produced a total of 38 plants. 



Assays 

All plants, as they became established in the GA7's, 
were tested by the chlorophenol red (CR) test for resistance to 
PPT as described in U.S. Patent Application 07/759,243, filed 
September 13, 1991, the relevant portions of which are hereby 
incorporated herein by reference. This assay utilizes a P H 
sensitive indicator dye to show which cells are growing in the 
presence of PPT. Cells which grow produce a pH change in the 
media and turn the indicator yellow (from red) . Plants 
expressing the resistance gene to PPT are easily seen in this 
test. (#176 - 8 positive/30 negative) Plants positive by the 
CR test were assayed by PCR for the presence of the synthetic 
BT gene. (#176 - 5 positive/2 negative/1 dead) 

Plants positive by PCR for the syn-BT gene were sent to 
the phytotron. Once established in the phytotron, they were 
characterized using insect bioassays and ELISA analysis. 
Plants were insect bioassayed using a standard European Corn 
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Borer assay (described in Example 5A, in which small pieces of 
leaf of clipped from a plant and placed in a small petri dish 
with a number of ECB neonate larvae. Plants are typically 
assayed at a height of about 6 inches. Plants showing 100% 
natality to ECB in this assay are characterized further. ELISA 
data are shown below. Positive plants are moved to the 
greenhouse . 

r.rP.P.nhouse/Fer tility 

Plant number #17 6-11 was pollinated with wild-type 
6K615 pollen. One tassel ear and one ear shoot were produced. 
All of the embryos from the tassel ear (ID and 56 Kernels from 
Ear 1 were rescued. 294 kernels remained on the ear and dried 

down naturally. 

Pollen from #176-11 was outcrossed to various maize 

genotypes 5N984, 5NA89, and 3K961. Embryos have been rescued 
from all 3 outcrosses (5N984 = 45; 5NA89 = 30; 3N961 = 8) . 
Most of the Kernels remained on the ears on the plants in the 
greenhouse and were dried down naturally . 

DNA was isolated from plant #176-11 using standard techniques 
and analysed by Southern blot analysis. It was found to 
contain sequences which hybridize with probes generated from 
the synthetic crylA(b) gene and with a probe generated from the 
PAT gene. These results showed integration of these genes into 
the genome of maize. 



Experiment KC-64 
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r 



Production of transgenic maize plants expressing the synthetic 
crylA(b) gene using a constitutive promoter. 



Tissue 

Immature maize embryos, approximately 1.5-2.5 mm in 

■ 

length, were excised from an ear of genotype 6N615 14-15 days 
after pollination. The mother plant was grown in the 
greenhouse. Before excision, the ear was surface sterilized 
with 20% Clorox for 20 minutes and rinse 3 times with sterile 
water. Individual embryos were plated scutellum side in a 2 cm 
square area, 36 embryos to a plate, on the callus initiation 
medium, 2DG4 + 5 chloramben medium (N6 major salts, B5 minor 
salts, MS iron, 2% sucrose, with 5 mg/1 chloramben, 20 mg/1 
glucose, and 10 ml G4 additions Table 1) added after 
autoc laving. 

TABLE 1 - G4 Additions 

Ingredient per liter medium 

Casein hydrolysate 0.5 gm 

Proline 1.3B gm 

Nicotinic acid .2 mg 

Pyridoxine-HCl .2 mg 

Thiamine-HCl . 5 mg 

Choline-HCl .1 mg 

Riboflavin .05 mg 

Biotin .1 mg 

Folic acid .05 mg 

Ca pantothenate .1 mg 

p-aminobenzoic acid .05 mg 

B12 .136 ug 

Bombardment 

Tissue was bombarded using the PDS-lOOOHe Biolistics 
device. The tissue was placed on the shelf B cm below the 

149 



WO 93/07278 



PCI7US92/08476 



ehplf The tissue was shot one time with the 
stopping screen shelf. Tne ciss 

D»*/gold microoarrier solution, 10 pi dried onto the 
..aerocarrier. The stopping soreen used was hand punohed at 
ABRU using 10*10 stainless steel mesh soreen. Kopture disos or 
15 50 psi value were used. After bombardment, the embryos -ere 
cultured in the dark at 25 • C. 

Preoarati nn of DP for deliver 

The microoarrier was prepared essentially according to 
the instructions supplied with the Biolistio device, while 
,orte*in, SO pi 1.0 U gold Aerocarrier, added 3.2 pi PC1B4410 

X 6 pi p CI B3007A (1.7 P,/pl> (#152) follow* bv 50 Ul ^ " 
Caci,. then 20 pi 0.1 « spermidine (free base. TC grade, . The 
resulting fixture was vortexed 3 minutes and microfuged for 10 
sec The supernatant was removed and the microcarriers washed 
2 times with 250 pi of 100% EtOH (HPLC grade, hy vortexrng 
briefly, centrif uging and removing the supernatant . The 
microcarriers are resuspended in 65 pi 100% EtOH. 

rallus fo rmation 

Eahryos were transferred to callus initiation medrum 
with 3 mg/1 PPT 1 day after bombardment . Embryos were scored 
for callus initiation at 2 and 3 weefcs after bombardment . Any 
responses were transferred to callus maintenance medium, 2DG4 
0 5 2,4-D medium with 3 mg/L PPT. callus maintenance medium i 
HS major salts, B5 minor salts, HS iron, 2% sucrose, with 0.5 
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mg/1 2,4-D, 20 mg/1 glucose, and 10 ml G4 additions added after 
utoclaving. Embryogenic callus was subcultured every 2 weeks 
to fresh maintenance medium containing 3 mg/L PPT. All callus 
was incubated in the dark at 25 # C. 

The Type I callus formation response was 18%. Every 
embryo which produced callus was cultured as an individual 
event giving rise to an individual line. 



Regeneration 

After 12 weeks on selection, the tissue was removed 
from callus maintenance medium with PPT and was placed on 
regeneration medium and incubated at 25 • C using a 16 hour 
light (50 pE .m-2 . s-1) / 8 hour dark photoperiod. 
Regeneration medium is 0.25MS3S5BA (0.25 mg/1 2,4 D, 5 mg/1 
BAP, MS salts, 3% sucrose) for 2 weeks followed by subculture 
to MS3S medium for regeneration of plants. After 4 to 10 
weeks, plants were removed and put into GA 7's. Our line KC64 
0-1, which became the #170 BT event, produced 55 plants. Our 
line KC64 0-7, which became the #171 BT event, produced a total 
of 33 plants. 



Assays 

Eleven plants, as they became established in the GA7's, 
were tested by the chlorophenol red (CR) test for resistance to 
PPT as per Shillito, et al, above. This assay utilizes a pH 
sensitive indicator dye to show which cells are growing in the 
presence of PPT. Cells which grow produce a pH change in the 
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indicator yellow (from red) - Plants 
media and turn the indicator ye a 

expressing the resistance gene to PPT ere easily seen rn thrs 
test. Piants positive by the CR test were assayed by PC* for 
th e presence of the synthetic BT gene. (Event 170 - 37 
positive/18 negative; #171 - 25 positive/8 negative. . 

Plants positive by PC* for the syn-Bt gene were sent to 
the phytotron. Once established in the phytotron, they were 

v^nassavs and ELISA analysis, 
characterized using insect bioassays 

Plants were insect bioassayed using a standard European corn 
borer assay (see below) in which small pieces of leaf of 
clipped from a lant and placed in a small petri dish with a 
number of ECB neonate larvae. Plants are typically assayed at a 
height of about 6 inches. Plants showing 100* mortality to ECB 
in this assay are characterised further. ELISA data are shown 
below. Positive plants are moved to the greenhouse. 

screening 

Eight of the mature plants from the #170 event were 
selected for evaluation of Basta IHoechst, resistance, on one 
middle leaf per plant, an area approximately 10-14 cm long X 
the leaf width was painted with 0, 0.4, 1.0 or 2.0% (10 ml of 
2 00 ,/L diluted to 100 ml with deionised water, agueou, Basta 

. - . ,n/un ml Two plants were tested 
containing 2 drops of Tween 20/100 ml. 

per level. Eight wild-type 6N61S plants of the same 
approximate age were treated as controls. Ml Plants were 
observed at 4 and 7 days . All of the control plants eventually 
died. Throughout the study, none of the .170 plants displayed 
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any damage due to the herbicide. 
Pollination 

All tassel ears, first ear and, if available, the 
second ear on the #170 and #171 plants were pollinated with 
wild-type 6N615 pollen. At least 90% of the plants were female 
fertile. 

Pollen from #171 plants was out crossed to genotypes 
6N615, 5N984, 5NA89, 6F010, 5NA56, 2N217AF, 2ND01 and 3N961. 
At least 90% of the plants were shown to be male fertile. 

Embryo Rescue 

Embryos from the #171 event have been "rescued." 
Fourteen to 16 days after pollination, the ear tip with 25-50 
kernels was cut from the ear with a coping saw. Prior to 
cutting, the husks were gently peeled away to expose the upper 
portion of the ear. The cut end of the ear on the plant was 
painted with Captan fungicide and the husks replaced. The seed 
remaining on the plant was allowed to dry naturally. 

The excised ear piece was surface sterilized with 20% 
Clorox for 20 minutes and rinsed 3 times with sterile water. 
Individual embryos were excised and plated scutellum side up on 
B5 medium [Gamborgl containing 2% sucrose. B5 itamins are 
added to the medium after autoclaving. Four embryos were 
plated per GA7 container and the containers incubated in the 
dark. When germination occurred, the containers were moved to 
a light culture room and incubated at 25- C using a 16 hour 
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^„ „ ? s -i) / 8 hour dark photoperiod. The 
light (50 uE .m-2 . S D / 

germination frequency is 94%. 

Progeny from 15 plants of the #171 event and 2 of the 
#1 7S event were rescued using standard embryo rescue techniques 
and evaluated. All plants were evaluated by insect assay. 
Plants from the #171 event were also tested in the 
his tochemical GUS assay. In both the insect assay and the GUS 
assay, the ratio of segregation of the transgenes was 1:1, a. 
expected for a single locus insertion event. 

EXAMPLE 48. Analysis of transgenic maize plants 

ELISA ASSAY 

Detection of crylA.b) gene expression in transgenic 
^ize is monitored using European corn borer (BCB) insect 
bioassays and ELISA analysis tor a guantitative determination 
of the level of orylA(b) protein obtained. 
Quantitative determination of =ryIA(b, IP in the leaves of 
transgenic plants -as performed using enryme-linked 
i^unosorbant assays (ELISA) as disclosed in Clar* M F. Lister 
R „, Bar-aoseph M= ELISA Technigues . In: w.issbach A, weissbaoh 
H ,eds) Method, i a F.nzvmoloov 118:742-766, Academic Press. 
Florida ,1966, . Immunoaf f inity purified polyclonal rabbxt and 
goat antibodies specific for the B. thurj ngiensis subsp. 
tors Caxi IP were used to determine n, IP per »g soluble protern 
fro » crude extracts of leaf samples. The sensitivity of the 
d ouble sandwich ELISA is 1-5 n, IP Per mg soluble protein usxng 
SO ug of total protein per ELISA microtiter dish well . 
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Corn extracts were made by grinding leaf tissue in 
gauze lined plastic bags using a hand held ball-bearing 
homogenizer (AGDIA, Elkart IN.) in the presence of extraction 
buffer (50 mM Na 2 C0 3 pH 9.5, 100 mM NaCl, 0.05% Triton, 0.05% 
Tween, 1 mM PMSF and 1 pM leupeptin) . Protein determination was 
performed using the Bio-Rad (Richmond, CA) protein assay. 

Using the above procedure, the primary maize 
transformants described above were analyzed for the presence of 
crylA(b) protein using ELISA . These plants varied in height 
from 6 inches to about three feet at the time of analysis. 

Bt ng/mg soluble protein 5/27/91 



Plant 


Bt nc 


176-8 


0 


176-10 


700 


176-li 


760 


17 1-4 A 


59 


171-6 


50 


171-8 


60 


171-9 


280 


171-13 


77 


171-14A 


43 


171-14B 


60 


171-15 


55 


171-16A 


13 


171-16B 


19 


171-18 


19 


176-30 


1160 


171-32 


980 


171-31 


166 


171-30 


370 


71-14 




#10 leaf 


26 


1 leaf 


17 


plant 171-16 


40 


#9 leaf 


#1 leaf 


120 



0 

1585 
2195 
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EUROPEAN CORN BORER ASSAY 



one to four 4 cm sections ere cut fro* en extended lea, c £ a 

corn plant. . a en 

2. Been leaf piece is placed on a listened filter d.sc rn 

X 9 nun petri dish. 

3 Five neonate European corn borer larvae ere placed on each 
leef Piece. (Making a total of 5-20 larvae per plant.) 

4 Toe Petri dishes are incubated at 29.5 -C. 

5 . Leaf feeding damage and mortality data are scored at 2, . . 
and 72 hours. 

EXaMPt* 49. Expression of Bt endotoxin in progeny of 
transformed maize plants 

The transformed maize plants were fully fertile and 
„ere crossed with several genotypes of maize . Progeny from 
Case crosses were analyzed for their ability to .ill European 
corn borer <ECB, in e standard ECB bioass.y .described 

.i for the presence of the crylA(b) 
immediately above) as well as for the pr 

• incx ELISA as described above. The ability to kill ECB 
protein using ELISA as ae»^- .,. 

, ,. nn of crv iA(b) protein correlated. These traits 
and the production of cryiAiDj 

segregated to the progeny with a 1:1 ratio, indicating a srngle 
sit e of insertion for the active copy of the synthetic gene. 
This 1=1 ratio was true for both the constitutive 
promoter/synthetic-crylA*, plants and the tissue specif rc 
promoter/synthetic-crylA.b, plants .data not shown, . 

Pig. 23A is a table containing a small subset of the 
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total number of progeny analyzed. This table is representative 
of a number of different crosses. 

Insect assays were done with Diatrea saccharalis and 
0s trinia nubilalis using leaf material (as described above) of 
transgenic progeny containing a maize optimized CrylA(b) gene. 
The results of these assays are shown in Fig. 23B. They 
demonstrate that the maize optimized CrylA(b) gene functions in 
transformed maize to provide resistance to Sugarcane borer and 
Ostrinia nubilalis . 

EXAMPLE 50. EXPRESSION OF THE CRYIA(b) GENE IN MAIZE POLLEN 

Progeny of the transformed maize plants containing the 
chimeric pollen promoter/synthetic crylA(b) gene derived from 
PCIB4431 were grown in the field to maturity. Pollen was 
collected and analyzed for the presence of the crylA(b) protein 
using standard ELISA techniquesd as described elsewhere. High 
levels of crylA(b) protein were detected in the pollen. 
Progeny from the 35S promoter/ synthetic crylA(b) transformed 
plant were grown in the greenhouse. Pollen from these plants 
was analyzed using ELISA, and crylA(b) protein was detected. 
Results are shown below in Figure 23C. 
It is recognized that factors including selection of 
plant lines, plant genotypes, synthetic sequences and the like, 
may also affect expression. 

EXAMPLE 51. EXPRESSION OF THE CRYIA(b) GENE FUSED TO A 
PITH-PREFERRED PROMOTER. 

PCIB4433 <Fig. 36) is a plasmid containing the maize 
optimized CrylA(b) gene fused with the pith-preferred promoter 
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This olasmid was constructed using a 
isolated from maize. This pxasnnu. 

three-way ligation consisting of: 

1) PCIB4418, cut with BstEII and BamHI; 1 - 8 Kb fragment 

2) pBtinl, cut with Nsil and BstEII ; 5.9 Kb fragment; 
pBtinl is described elsewhere in this application 

3) PGR fragment VI-151 was generated in a PGR reaction 
using standard conditions as described elsewhere in this 
application. 

PCR primers utilized were: 

KE150A28: 5' -ATT CGC ATG CAT GTT TCA TTA TC-3' 

KE151A28: 5' - GCT GGT AGC AGG GAT CGG TGG GTT CTG TGC AAG AAG 



C-3 



After the PCR reaction, the DNA was checked on an 
agarose gel to malce sure the reaction had proceeded properly. 
DNA was recovered from the PCR reaction using standard 
conditions described elsewhere and subsequently cut with the 
restriction enzymes Nsil and BamHI using standard condition, 
^ter cutting, the fragment was run on a 2% NuSieve gel and the 
desired band recovered as described elsewhere. The DNA was 
used in the ligation described above. 

After ligation (under standard condition) , the DNA was 

transformed into competent L. £Oli cell . 

Transformation was carried out using micropro jectile 
bombardment essentially as described elsewhere in this 
application. Embryos were transferred to medium containing 
lOaug/ml »I 24 hours after micropro jectile bombardment. 
Resulting callus was transferred to medium containing 40 pg/ml 
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PPT after four weeks. Plants were regenerated without 
selection . 

A small sample of plants (3-5) was assayed by PCR for 
each event. Further codes were added to indicate different 
positions and distances of embryos with respect to the 
microprojectile bombardment device. Plants were sent to the 
greenhouse having the following codes: 

J521A TOP Plants B^t. PCR Positive 

JS21A MID Plants B.t . PCR Positive 

JS21C BOT Plants B.t . PCR Positive 

JS22D MID Plants B.t. PCR Positive 

JS23B MID Plants B.t. PCR Negative (for control) 

Leaf samples from the regenerated plants were 
bioassayed for insecticidal activity against European corn 
borer as described in Example 4 8 with the results shown in Fig. 
23D. 

ELISA analysis of leaf samples to quantify the level of 
CrylA(b) protein expressed in the leaves was carried out as 
described in Example 4 8 with the results shown in Fig. 24E. 

Deposits 

The following plasraids have been deposited with the 
Agricultural Research Culture Collection (NRRL) (1818 N. 
University St., Peoria, IL 61604) under the provisions of the 
Budapest Treaty: pCIB4418, pCIB4420, pCIB4429, pCIB4431, 
pCIB4433, pCIB5601, pCIB3166 and pCIB3171. 
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The present invention has been described with reference 
„ specific embodiments thereof; however it will be appreciated 
that numerous variations, modifications, and embodiments are 
possible. According, all such variations, modifications and 
embodiments are to be regarded as being within the spirit and 
scope of the present invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Evola, Stephen V. 

Crossland, Lyle D. 
Wright , Martha S. 
Merlin, Ellis J. 
Launis, Karen L. 
Rothstein, Steven J. 

(ii) TITLE OF INVENTION: SYNTHETIC DNA SEQUENCE HAVING ENHANCED 
INSECTICIDAL ACTIVITY IN MAIZE 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CIBA-GEIGY Corporation 

(B) STREET: 7 Skyline Drive 

(C) CITY: Hawthorne 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP : 10532 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE / DOCKET NUMBER: CGC 1577/CIP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (919)541-8615 

(B) TELEFAX : (919)541-8689 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MCKLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

5£JSS£*i«iU» thuringiensis 
S SS5J*iS&J!"Ul*tM insscticida! crysta! 
protein gene 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GTTAACACCC T GGGTCAAAA ATTGATATTT AGTAAAATTA GTTGCACTTT GTGCATTTTT 
TCATAAGAT G AGT CAT ATGT TTTAAATTGT AGTAATGAAA AACAGTATTA TATCATAATG 
AATTGGTATC T T AAT AAAAG AGATGGAGGT AACTTATGGA TAACAATCCG AACATCAATG 
AATGCATTCC TTATAATTGT TTAAGTAACC CTGAAGTAGA AGTATTAGGT GGAGAAAGAA 
TAGAAACTGG TTACACCCCA ATCGATATTT CCTTGTCGCT AACGCAATTT CTTTTGAGTG 
AATTTGTTCC CGGTGCTGGA TTTGTGTTAG GACTAGTTGA TATAATATGG GGAATTTTTG 
GTCCCTCTCA ATGGGACGCA TTTCTTGTAC AAATTGAACA GTTAATTAAC CAAAGAATAG 
AAGAATTCGC TAGGAACCAA GCCATTTCTA GATTAGAAGG ACTAAGCAAT CTTTATCAAA 
TTTACGCAGA ATCTTTTAGA GAGT GGGAAG CAGATCCTAC TAATCCAGCA TTAAGAGAAG 
AGATGCGTAT TCAATTCAAT GACATGAACA GTGCCCTTAC AACCGCTATT CCTCTTTTTG 
CAGTTCAAAA TTATCAAGTT CCTCTTTTAT CAGTATATGT TCAAGCTGCA AATTTACATT 
TATCAGTTTT GAGAGATGTT TCAGTGTTTG GACAAAGGTG GGGATTTGAT GCCGCGACTA 
TCAATAGTCG TTATAATGAT TTAACTAGGC TTATTGGCAA CTATACAGAT CATGCTGTAC 
GCTGGTACAA TACGGGATTA GAGCGTGTAT GGGGACCGGA TTCTAGAGAT TGGATAAGAT 
ATAAT CAATT TAGAAGAGAA TTAACACTAA CTGTATTAGA TATCGTTTCT CTATTTCCGA 
ACTATGATAG TAGAACGTAT CCAATTCGAA CAGTTTCCCA ATTAACAAGA GAAATTTATA 
CAAACCCAGT AT T AGAAAAT TTTGATGGTA GTTTTCGAGG CTCGGCTCAG GGCATAGAAG 
GAAGTATTAG GAGTCCACAT TTGATGGATA TACTTAACAG TATAACCATC TATACGGATG 
CTCATAGAGG AGAATATTAT TGGTCAGGGC ATCAAATAAT GGCTTCTCCT GTAGGGTTTT 
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CGGGGCCAGA ATTCACTTTT CCGCTATATG GAACTATGGG AAATGCAGCT CCACAACAAC 120 0 

GTATTGTTGC TCAACTAGGT CAGGGCGTGT AT AGAACAT T ATCGTCCACT TTATATAGAA 1260 

GACCTTTTAA TATAGGGATA AATAATCAAC AACTATCTGT TCTTGACGGG ACAGAATTTG 1320 

* CTTATGGAAC CTCCTCAAAT TTGCCATCCG CTGTATACAG AAAAAGC G G A ACGGTAGATT 1380 

• CGCTGGATGA AATACCGCCA CAGAATAACA ACGTGCCACC TAGGCAAGGA TTTAGTCATC 1440 
GATTAAGCCA TGTTTCAATG TTTCGTTCAG GCTTTAGTAA TAGTAGTGTA AGTATAATAA 1500 
GAGCTCCTAT GTTCTCTTGG ATACATCGTA GTGCTGAATT TAATAATATA ATTCCTTCAT 1560 
CACAAATTAC ACAAATACCT TTAACAAAAT CTACTAATCT TGGCTCTGGA ACTTCTGTCG 1620 
TTAAAGGACC AGGATTTACA GGAGGAGATA TTCTTCGAAG AACTTCACCT GGCCAGATTT 1680 
CAACCTTAAG AGTAAATATT ACTGCACCAT TATCACAAAG ATATCGGGTA AGAATTCGCT 1740 
ACGCTTCTAC C ACAAAT TTA CAATTCCATA CAT CAAT TGA CGGAAGACCT ATTAATCAGG 1800 
GGAATTTTTC AGCAACTATG AGTAGTGGGA GTAATT TACA GTCCGGAAGC TTTAGGACTG I860 
TAGGTTTTAC TACTCCGTTT AACTTTTCAA ATGGATCAAG TGTATTTACG TTAAGTGCTC 1920 
ATGTCTTCAA TTCAGGCAAT GAAGTTTATA TAGATCGAAT TGAATTTGTT CCGGCAGAAG 1980 
TAACCTTTGA GGCAGAATAT GATTTAGAAA GAGCACAAAA GGCGGTGAAT GAGCTGTTTA 2040 
CTTCTTCCAA TCAAATCGGG T T AAAAAC AG ATGTGACGGA TTATCATATT GATCAAGTAT 2100 
CCAATTTAGT TGAGTGTTTA TCTGATGAAT TTTGTCTGGA TGAAAAAAAA GAATTGTCCG 2160 
AGAAAGTCAA ACATGCGAAG CGACTTAGTG ATGAGCGGAA TTTACTTCAA GATCCAAACT 2220 
TTAGAGGGAT CAATAGACAA CTAGACCGTG GCTGGAGAGG AAGTACGGAT ATTACCATCC 2280 
AAGGAGGCGA TGACGTATTC AAAGAGAATT ACGTTACGCT ATTGGGTACC TTTGATGAGT 234 0 
GCTATCCAAC GTATTTATAT CAAAAAATAG ATGAGTCGAA ATTAAAAGCC TATACCCGTT 2400 
ACCAATTAAG AGGGTATATC GAAGATAGTC AAGACTTAGA AATCTATTTA ATTCGCTACA 2460 
ATGCCAAACA CGAAACAGTA AATGTGCCAG GTACGGGTTC CTTATGGCCG CTTTCAGCCC 2520 

; CAAGTCCAAT CGGAAAATGT GCCCATCATT CCCATCATTT CTCCTTGGAC ATTGATGTTG 2580 

GATGTACAGA CTTAAATGAG GACTTAGGTG TATGGGTGAT ATTCAAGATT AAGACGCAAG 2640 

" ATGGCCATGC AAGACTAGGA AATCTAGAAT TTCTCGAAGA GAAACCATTA GTAGGAGAAG 2700 
CACTAGCTCG TGTGAAAAGA GCGGAGAAAA AATGGAGAGA CAAACGTGAA AAATTGGAAT 
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GGGAAACAAA TATTGTTTAT 
CTCAATATGA TAGATTACAA 
GCGTTCATAG CATTCGAGAA 
CGGCTATTTT TGAAGAATTA 
GAAATGTCAT TAAAAATGGT 
ATGTAGATGT AGAAGAACAA 
CAGAAGTGTC ACAAGAAGTT 
CGTACAAGGA GGGATATGGA 
ACGAACTGAA GTTTAGCAAC 
GTAATGATTA TACTGCGACT 
GATATGACGG AGCCTATGAA 
AAGAAAAAGC ATATACAGAT 
GGGATTACAC ACCACTACCA 
CCGATAAGGT ATGGATTGAG 
AATTACTTCT TATGGAGGAA 
GAT T ACTGAC TTGTATTGAC 
TCACTCTTAA AAGAAT GATG 
TTTTTTGCGA AGGCTTTACT 
CACTACCCCC AAGTGTCAAA 
ATTTTTTATG AATCTTTCAA 
T CAT TTAACC CCTTCTCTTT 
ACGAAAGTTT TCAGGAAATG 
GAGT GATTCT CTCGTTCGAC 
CCAGAAG GAC T CAAT AAACG 
TCTGCATTAT GGAAAAGTAA 
TATTTTCAAC GAATCCGTAT 
CATGTATATC CTGGGTCAGG 



AAAGAGGCAA AAGAATCTGT 
GCGGATACCA ACATCGCGAT 
GCTTATCTGC CTGAGCTGTC 
GAAGGGCGTA TTTTCACTGC 
GATTTTAATA ATGGCTTATC 
AACAACCACC GTTCGGTCCT 
CGTGTCTGTC CGGGTCGTGG 
GAAGGTTGCG TAACCATTCA 
TGTGTAGAAG AGGAAGTATA 
CAAGAAGAAT ATGAGGGTAC 
AGCAATTCTT CTGTACCAGC 
GGACGAAGAG ACAATCCTTG 
GCTGGCTATG TGACAAAAGA 
AT CGGAGAAA CGGAAGGAAC 
TAATATATGC TTTATAATGT 
AGATAAATAA GGAAATTTTT 
TCCGTTTTTT GTATGATTTA 
TAACGGGGTA CCGCCACATG 
AAACGT T ATT CTTTCTAAAA 
TTCAAGATGA ATTACAACTA 
TGGAAGAACT CGCTAAAGAA 
AATTAGCTAC CATATGTATC 
TATGCAGTCA ATT ACACGC C 
CTTTGATAAA AAAGCGGTTG 
ACTTTGTAAA ACAT CAGCCA 
TTTAGATGCG ACGATTTTCC 
TGGTTGTGCA CAAACTGCAG 



AGATGCTTTA 
GATTCATGCG 
TGTGATTCCG 
ATTCTCCCTA 
CTGCTGGAAC 
TGTTGTTCCG 
CTATATCCTT 
TGAGATCGAG 
TCCAAACAAC 
GTACACTTCT 
TGATTATGCA 
TGAATCTAAC 
AT T AGAGT AC 
ATTCATCGTG 
AAGGTGTGCA 
AT AT GAAT AA 
ACGAGTGATA 
CCCATCAACT 
AGCTAGCTAG 
TTTTCTGAAG 
TTAGGTTTTG 
TGGGGCAGTC 
GCCACAGCAC 
AATTTTTGAA 
TTTCAAGTGC 
AAGTACCGAA 



TTTGTAAACT 
GCAGATAAAC 
GGTGTCAATG 
TATGATGCGA 
GTGAAAGGGC 
GAATGGGAAG 
CGTGTCACAG 
AACAATACAG 
ACGGTAACGT 
CGTAATCGAG 
TCAGCCTATG 
AGAGGATATG 
TTCCCAGAAA 
GACAGCGTGG 
AATAAAGAAT 
AAAACGGGCA 
TTTAAATGTT 
TAAGAATTTG 
AAAGGAT GAC 
AGCTGTATCG 
TAAAAAGAAA 
AACGTACAGC 
TCTTATGAGT 
ATATATTTTT 
AGCACTCACG 
ACATTTAGCA 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pure maize optimized synthetic BT cryiAio; 

gene 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GG TGCAGATC 
GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GAGGGCCTGA GCAACCTGTA CCAGATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT T CAACGAC AT GAACAGCGCC 
CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTGAGCGT GTTCGGCCAG 
CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGC 
CCCGACAGCC GCGACTGGAT CCGCTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
AGCCAGCTGA CCCGCGAGAT CTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
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ATCATGGCCA GCCCCGTGGG 
ATGGGCAACG CCGCCCCCCA 
ACCCT GAGCA GCACCCTGTA 
AGCGTGCTGG ACGGCACCGA 
TACCGCAAGA GCGGCACCGT 
CCCCCCCGCC AGGGCTTCAG 
AGCAACAGCA GCGTGAGCAT 
GAGT TCAAC A ACATCATCCC 
AACCTGGGCA GCGGCACCAG 
CGCCGCACCA GCCCCGGCCA 
CAGCGCTACC GCGTGCGCAT 
ATCGACGGCC GCCCCATCAA 
CTGCAGAGCG GCAGCTTCCG 
AGCAGCGTGT TCACCCTGAG 
CGCATCGAGT TCGTGCCCGC 
CAGAAGGCCG TGAACGAGCT 
ACCGACTACC ACATCGACCA 
CTGGACGAGA AGAAGGAGCT 
CGCAACCTGC TGCAGGACCC 
CGCGGCAGCA CCGACATCAC 
ACCCTGCTGG GCACCTTCGA 
AGCAAGCTGA AGGCCTACAC 
CTGGAGATCT ACCTGATCCG 
GGCAGCCTGT GGCCCCTGAG 
CACTTCAGCC TGGACATCGA 
GTGATCTTCA AGATCAAGAC 
GAGGAGAAGC CCCTGGTGGG 



CTTCAGCGGC 
GCAGCGCATC 
CCGCCGCCCC 
GTTCGCCTAC 
GGACAGCCTG 
CCACCGCCTG 
CATCCGCGCC 
CAGCAGCCAG 
CGTGGTGAAG 
GATCAGCACC 
CCGCTACGCC 
CCAGGGCAAC 
CACCGTGGGC 
CGCCCACGTG 
CGAGGTGACC 
GTTCACCAGC 
GGTGAGCAAC 
GAGCGAGAAG 
CAACTTCCGC 
CATCCAGGGC 
CGAGTGCTAC 
CCGCTACCAG 
CTACAACGCC 
CGCCCCCAGC 
CGTGGGCTGC 
CCAGGACGGC 
CGAGGCCCTG 



CCCGAGTTCA 
GTGGCCCAGC 
TTCAACATCG 
GGCACCAGCA 
GACGAGATCC 
AGCCACGTGA 
CCCATGTTCA 
ATCACCCAGA 
GGCCCCGGCT 
CTGCGCGTGA 
AGCACCACCA 
TTCAGCGCCA 
TTCACCACCC 
TTCAACAGCG 
TTCGAGGCCG 
AG CAACCAG A 
CTGGTGGAGT 
GTGAAGCACG 
GGCAT CAACC 
GGCGACGACG 
CCCACCTACC 
CTGCGCGGCT 
AAG CACGAGA 
CCCATCGGCA 
ACCGACCTGA 
CACGCCCGCC 
GCCCGCGTGA 



CCTTCCCCCT 
TGGGCCAGGG 
GC AT CAACAA 
GCAACCTGCC 
CCCCCCAGAA 
GCATGTTCCG 
GCTGGATCCA 
TCCCCCTGAC 
TCACCGGCGG 
ACATCACCGC 
ACCTGCAGTT 
CCATGAGCAG 
CCTTCAACTT 
GCAACGAGGT 
AGTACGACCT 
TCGGCCTGAA 
GCCTGAGCGA 
CCAAGCGCCT 
GCCAGCTGGA 
TGTTCAAGGA 

TGT AC CAGAA 
ACAT CGAGGA 
CCGTGAACGT , 
AGTGCGCCCA 
ACGAGGACCT 
TGGGCAACCT 
AGCGCGCCGA 



GTACGGCACC 
CGTGTACCGC 
CCAGCAGCTG 
CAGCGCCGTG 
CAACAACGTG 
CAGCGGCTTC 
CCGCAGCGCC 
CAAGAGCACC 
CGACATCCTG 
CCCCCTGAGC 
CCACACCAGC 
CGGCAGCAAC 
CAGCAACGGC 
GTACATCGAC 
GGAGCGCGCC 
GACCGACGTG 
CGAGTTCTGC 
GAGCGACGAG 
CCGCGGCTGG 
GAACTACGTG 
GATCGACGAG 
CAGCCAGGAC 
GCCCGGCACC 
CCACAGCCAC 
GGGCGTGTGG 
GGAGTTCCTG 
GAAGAAGTGG 
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CGCGACAAGC GCGAGAAGCT GGAGTGGGAG ACCAACATCG TGTACAAGGA GGCCAAGGAG 
AGCGTGGACG CCCTGTTCGT GAACAGCCAG TACGACCGCC TGCAGGCCGA CACCAACATC 
GCCATGATCC ACGCCGCCGA CAAGCGCGTG CACAGCATCC GCGAGGCCTA CCTGCCCGAG 
CTGAGCGTGA TCCCCGGCGT GAACGCCGCC ATCTTCGAGG AGCTGGAGGG CCGCATCTTC 
ACCGCCTTCA GCCTGTACGA CGCCCGCAAC GTGATCAAGA ACGGCGACTT CAACAACGGC 
CTGAGCTGCT GGAACGTGAA GGGCCACGTG GACGTGGAGG AGCAGAACAA CCACCGCAGC 
GTGCTGGTGG TGCCCGAGTG GGAGGCCGAG GTGAGCCAGG AGGTGCGCGT GTGCCCCGGC 
CGCGGCTACA TCCTGCGCGT GACCGCCTAC AAGGAGGGCT ACGGCGAGGG CTGCGTGACC 
ATC C AC GAGA TCGAGAACAA CACCGACGAG CTGAAGTTCA GCAACTGCGT G GAG GAG GAG 
GTGTACCCCA AC AAC AC C G T GACCTGCAAC GACTACACCG CCACCCAGGA GGAG TACG AG 
GGCACCTACA CCAGCCGCAA CCGCGGCTAC GACGGCGCCT ACGAGAGCAA CAGCAGCGTG 
CCCGCCGACT ACGCCAGCGC CTACGAGGAG AAGGCCTACA CCGACGGCCG CCGCGACAAC 
CCCTGCGAGA GCAACCGCGG CTACGGCGAC TACACCCCCC TGCCCGCCGG CTACGTGACC 
AAGGAGCTGG AGTACTTCCC CGAGACCGAC AAGGTGTGGA TCGAGATCGG CGAGACCGAG 
GGCACCTTCA TCGTGGACAG CGTGGAGCTG CTGCTGATGG AGGAGTAGTA CATG 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Truncated synthetic maize optimized bt 

CrylA(b) gene 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GATCCAACAA TGGACAACAA CCCCAACATC AACGAGTGCA TCCCCTACAA CTGCCTGAGC 
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AACCCCGAGG 
ATCAGCCTGA 
CTGGGCCTGG 
GTGCAGATCG 
AGCCGCCTGG 
GAGGCCGACC 
AACAGCGCCC 
CTGAGCGTGT 
TTCGGCCAGC 
CGCCTGATCG 
GTGTGGGGTC 
CTGACCGTGC 
CGCACCGTGA 
GGCAGCTTCC 
GACATCCTGA 
GGCCACCAGA 
TACGGCACCA 
GTGTACCGCA 
CAGCAGCTGA 
AGCGCCGTGT 
AACAACGTGC 
AGTGGCTTCA 
CGCAGTGCCG 
AAGAGCACCA 
GACATCCTGC 
CCCCTGAGCC 
CACACCAGCA 



TGGAGGTGCT 
GCCTGACCCA 
TGGACATCAT 
AGCAGCT GAT 
AGGGCCTGAG 
CCACCAACCC 
TGACCACCGC 
ACGTGCAGGC 
GCTGGGGCTT 
GCAACTACAC 
CCGACAGCCG 
TGGACATCGT 
GCCAGCTGAC 
GCGGCAGCGC 
ACAGCATCAC 
TCATGGCCAG 
TGGGCAACGC 
CCCTGAGCAG 
GCGTGCTGGA 
ACCGCAAGAG 
CACCTCGACA 
GCAACAGCAG 
AGTTCAACAA 
ACCTGGGCAG 
GCCGCACCAG 
AGCGCTACCG 
TCGACGGCCG 



GGGCGGCGAG 
GTTCCTGCTG 
CTGGGGCATC 
CAACCAGCGC 
CAACCTGTAC 
CGCCCTGCGC 
CATCCCCCTG 
CGCCAACCTG 
CGACGCCGCC 
CGACCACGCC 
CGACTGGATC 
GAGCCTGTTC 
CCGCGAGATT 
CCAGGGCATC 
CATCTACACC 
CCCCGTCGGC 
TGCACCTCAG 
CACCCTGTAC 
CGGCACCGAG 
CGGCACCGTG 
GGGCTTCAGC 
CGTGAGCATC 
CATCATCCCC 
CGGCACCAGC 
CCCCGGCCAG 
CGTCCGCATC 
CCCCATCAAC 



CGCATCGAGA 
AGCGAGTTCG 
TTCGGCCCCA 
ATCGAGGAGT 
CAAATCTACG 
GAGGAGATGC 
TTCGCCGTGC 
CACCTGAGCG 
ACCATCAACA 
GTGCGCTGGT 
AGGTACAACC 
CCCAACTACG 
TACACCAACC 
GAGGGCAGCA 
GACGCCCACC 
TTCAGCGGCC 
CAGCGCATCG 
CGTCGACCTT 
TTCGCCTACG 
GACAGCCTGG 
CACCGTCTGA 
ATCCGTGCAC 
AGCAGCCAGA 
GTGGTGAAGG 
ATCAGCACCC 
CGCTACGCCA 
CAGGGCAACT 



CCGGCTACAC 
TGCCCGGCGC 
GCCAGTGGGA 
TCGCCCGCAA 
CCGAGAGCTT 
GCATCCAGTT 
AGAACTACCA 
TGCTGCGCGA 
GCCGCTACAA 
ACAACACCGG 
AGTTCCGCCG 
ACAGCCGCAC 
CCGTGCTGGA 
TCCGCAGCCC 
GCGGCGAGTA 
CCGAGTTCAC 
TGGCACAGCT 
TCAACATCGG 
GCACCAGCAG 
ACGAGATCCC 
GCCACGTGAG 
CTATGTTCAG 
TCACCCAGAT 
GCCCCGGCTT 
TGCGCGTGAA 
GCACCACCAA 
TCAGCGCCAC 



CCCCATCGAC 
CGGCTTCGTG 
CGCCTTCCTG 
CCAGGCCATC 
CCGCGAGTGG 
CAACGACATG 
GGTGCCCCTG 
CGTCAGCGTG 
CGACCTGACC 
CCTGGAGCGC 
CGAGCTGACC 
CTACCCCATC 
GAACTTCGAC 
CCACCTGATG 
CTACTGGAGC 
CTTCCCCCTG 
GGGCCAGGGA 
CAT CAACAAC 
CAACCTGCCC 
CCCTCAGAAC 
CATGTTCCGC 
CTGGATTCAC 
CCCCCTGACC 
CACCGGCGGC 
CATCACCGCC 
CCTGCAGTTC 
CATGAGCAGC 
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GGCAGCAACC TGCAGAGCGG CAGCTTCCGC ACCGTGGGCT TCACCACCCC CTTCAACTTC 
AGCAACGGCA GCAGCGTGTT CACCCTGAGC GCCCACGTGT TCAACAGCGG CAACGAGGTG 
TACATCGACC GCATCGAGTT CGTGCCCGCC GAGGTGACCT TCGAGGCCGA GTACGACCTG 
GAGAGGGCTC AGAAGGCCGT GAACGAGCTG TTCACCAGCA GCAACCAGAT CGGCCTGAAG 
ACCGACGTGA CCGACTACCA CATCGATCAG GTGTAGGAGC T 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: . . ._ oH « T 

(A) ORGANISM : Full length synthetic maize optimized BT 

CrylA(b) gene 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
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1800 
1860 
1920 
1961 



GATCCAACAA 


TGGACAACAA 


GCCCAACATC 


AACGAGTGCA 


TCCCCTACAA 


CTGCCTGAGC 


60 


AACCCCGAGG 


TGGAGGTGCT 


GGGCGGCGAG 


CGCATCGAGA 


CCGGCTACAC 


CCCCATCGAC 


120 


ATCAGCCTGA 


GCCTGACCCA 


GTTCCTGCTG 


AGCGAGTTCG 


TGCCCGGCGC 


CGGCTTCGTG 


1B0 


CTGGGCCTGG 


TGGACATCAT 


CTGGGGCATC 


TTCGGCCCCA 


GCCAGTGGGA CGCCTTCCTG 


240 


GTGCAGATCG 


AGCAGCTGAT 


CAACCAGCGC 


ATCGAGGAGT 


TCGCCCGCAA 


CCAGGCCATC 


300 


AGCCGCCTGG 


AGGGCCTGAG 


CAACCTGTAC 


CAAATCTACG 


CCGAGAGCTT 


CCGCGAGTGG 


360 


GAGGCCGACC 


CCACCAACCC 


CGCCCTGCGC 


GAGGAGATGC 


GCATCGAGTT 


CAACGACATG 


420 


AACAGCGCCC 


TGACCACCGC 


CATCCCCCTG 


TTCGCCGTGC 


AGAACTACCA 


GGTGCCCCTG 


480 


CTGAGCGTGT 


ACGTGCAGGC 


CGCCAACCTG 


CACCTGAGCG 


TGCTGCGCGA 


CGTCAGCGTG 


540 


TTCGGCCAGC 


GCTGGGGCTT 


CGACGCCGCC 


ACCATCAACA 


GCCGCTACAA 


CGACCTGACC 


600 


CGCCTGATCG 


GCAACTACAC 


CGACCACGCC 


GTGCGCTGGT 


ACAACACCGG 


CCTGGAGCGC 


660 


GTGTGGGGTC 


CCGACAGCCG 


CGACTGGATC 


AGGTACAACC 


AGTTCCGCCG 


CGAGCTGACC 


720 
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CTGACCGTGC TGGACATCGT 
CGCACCGTGA GCCAGCTGAC 
GGCAGCTTCC GCGGCAGCGC 
GACATCCTGA ACAGCATCAC 
GGCCACCAGA TCATGGCCAG 
TACGGCACCA TGGGCAACGC 
GTGTACCGCA CCCTGAGCAG 
CAGCAGCTGA GCGTGCTGGA 
AGCGCCGTGT ACCGCAAGAG 
AACAACGTGC CACCTCGACA 
AGTGGCTTCA GCAACAGCAG 
CGCAGTGCCG AGTTCAACAA 
AAGAGCACCA ACCTGGGCAG 
GACATCCTGC GCCGCACCAG 

■ 

CCCCTGAGCC AGCGCTACCG 
CACACCAGCA TCGACGGCCG 
GGCAGCAACC TGCAGAGCGG 
AGCAACGGCA GCAGCGTGTT 
TACATCGACC GCATCGAGTT 
GAGAGGGCTC AGAAGGCCGT 
ACCGACGTGA CCGACTACCA 
GAGTTCTGCC TGGACGAGAA 
AGCGACGAGC GCAACCTGCT 
CGCGGCTGGC GCGGCAGCAC 
AACTACGTGA CCCTGCTGGG 
ATCGACGAGA GCAAGCTGAA 
AGCCAGGACC TGGAGATCTA 



GAGCCTGTTC CCCAACTACG ACAGCCGCAC 
CCGCGAGATT TACACCAACC CCGTGCTGGA 
CCAGGGCATC GAGGGCAGCA TCCGCAGCCC 
CATCTACACC GACGCCCACC GCGGCGAGTA 
CCCCGTCGGC TTCAGCGGCC CCGAGTTCAC 
TGCACCTCAG CAGCGCATCG TGGCACAGCT 
CACCCTGTAC CGTCGACCTT TCAACATCGG 
CGGCACCGAG TTCGCCTACG GCACCAGCAG 
CGGCACCGTG GACAGCCTGG ACGAGATCCC 
GGGCTTCAGC CACCGTCTGA GCCACGTGAG 
CGTGAGCATC ATCCGTGCAC CTATGTTCAG 
CATCATCCCC AGCAGCCAGA TCACCCAGAT 
CGGCACCAGC GTGGTGAAGG GCCCCGGCTT 
CCCCGGCCAG ATCAGCACCC TGCGCGTGAA 
CGTCCGCATC CGCTACGCCA GCACCACCAA 
CCCCATCAAC CAGGGCAACT TCAGCGCCAC 
CAGCTTCCGC ACCGTGGGCT TCACCACCCC 
CACCCTGAGC GCCCACGTGT TCAACAGCGG 
CGTGCCCGCC GAGGTGACCT TCGAGGCCGA 
GAACGAGCTG TTCACCAGCA GCAACCAGAT 
CATCGATCAG GTGAGCAACC TGGTGGAGTG 
GAAGGAGCTG AGCGAGAAGG TGAAGCACGC 
GCAGGACCCC AACTTCCGCG GCAT CAACCG 
CGACATCACC ATCCAGGGCG GCGACGACGT 
CACCTTCGAC GAGTGCTACC CCACCTACCT 
GGCCTACACC CGCTACCAGC TGCGCGGCTA 
CCTGATCCGC TACAACGCCA AGCACGAGAC 



CTACCCCATC 
GAACTTCGAC 
CCACCTGATG 
CTACTGGAGC 
CTTCCCCCTG 
GGGCCAGGGA 
CATCAACAAC 
CAACCTGCCC 
CCCTCAGAAC 
CATGTTCCGC 
CTGGATTCAC 
CCCCCTGACC 
CACCGGCGGC 
CATCACCGCC 
CCTGCAGTTC 
CATGAGCAGC 
CTTCAACTTC 
CAACGAGGTG 
GTACGACCTG 
CGGCCTGAAG 
CCTGAGCGAC 
CAAGCGCCTG 
CCAGCTGGAC 
GTTCAAGGAG 
GTACCAGAAG 
CATCGAGGAC 
CGTGAACGTG 
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CCCGGCACCG GCAGCCTGTG GCCCCTGAGC GCCCCCAGCC CCATCGGCAA GTGCGCCCAC 
CACAGCCACC ACTTCAGCCT GGACAT CGAC GTGGGCTGCA CCGACCTGAA CGAGGACCTG 
GGCGTGTGGG TGATCTTCAA GATCAAGACC CAGGACGGCC ACGCCCGCCT GGGCAACCTG 
GAGTTCCTGG AGGAGAAGCC CCTGGTGGGC GAGGCCCTGG CCCGCGTGAA GCGCGCCGAG 
AAGAAGTGGC GCGACAAGCG CGAGAAGCTG GAGTGGGAGA CCAACATCGT GTACAAGGAG 
GCCAAGGAGA GCGTGGACGC CCTGTTCGTG AACAGCCAGT ACGACCGCCT GCAGGCCGAC 
ACCAACATCG CCATGATCCA CGCCGCCGAC AAGCGCGTGC ACAGCATTCG CGAGGCCTAC 
CTGCCCGAGC TGAGCGTGAT CCCCGGCGTG AACGCCGCCA TCTTCGAGGA GCTGGAGGGC 
CGCATCTTCA CCGCCTTCAG CCTGTACGAC GCCCGCAACG TGATCAAGAA CGGCGACTTC 
AACAACGGCC TGAGCTGCTG GAACGTGAAG GGCCACGTGG ACGTGGAGGA GCAGAACAAC 
CACCGCAGCG TGCTGGTGGT GCCCGAGTGG GAGGCCGAGG TGAGCCAGGA GGTGCGCGTG 
TGCCCCGGCC GCGGCTACAT CCTGCGCGTG ACCGCCTACA AGGAGGGCTA CGGCGAGGGC 
TGCGTGACCA TCCACGAGAT CGAGAACAAC ACCGACGAGC TCAAGTTCAG CAACTGCGTG 
GAGGAGGAGG TGTACCCCAA CAACACCGTG ACCTGCAACG ACTACACCGC CACCCAGGAG 
GAGTACGAGG GCACCTACAC CAGCCGCAAC CGCGGCTACG ACGGCGCCTA CGAGAGCAAC 
AGCAGCGTGC CCGCCGACTA CGCCAGCGCC TACGAGGAGA AGGCCTACAC CGACGGCCGC 
CGCGACAACC CCTGCGAGAG CAACCGCGGC TACGGCGACT ACACCCCCCT GCCCGCCGGC 
TACGTGACCA AGGAGCTGGA GTACTTCCCC GAGACCGACA AGGTGTGGAT CGAGATCGGC 
GAGACCGAGG GCACCTTCAT CGTGGACAGC GTGGAGCTGC TGCTGATGGA GGAGTAGTAC 
ATGTGATAGT ACGTAAGCTC GAGGATCT 
(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
, (D) TOPOLOGY: linear 

m 

(ii) MOLECULE TYPE: DNA 
« (iv) ANTI-SENSE: NO 

<Vi> °«u I S£JSS? i Truncated synthetic BT gene (Perlak et al . ) 
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(xi , SEQUENCE DESCRIPTION: SEQ ID NO : 5 : taaCCCAGAA 
L ACCCAAACAT CAACGAATGC ATTCCATACA ACTGCTTGAG TAACCCAGAA 
AT GGACAACA ACCCAAACAT arrGGTTACA CTCCCATCGA CATCTCCTTG 

GTTGAAGTAC TTGGTGGAGA ACGCATTGAA ACCGGTTACA TrTCGGAC TA 
GTTGAAG CAGCGAGTTC GTGCCAGGTG CTGGGTTCGT TCTCGGACTA 

TCCTTGACAC AGTTTCTGCT ^ an . rrATTC CT GGTGCAAATT 

_ ^ rrcGTA T CTTTGGTCCA TCTCAATGGG ATGCATTCCT C 
GTTGACATCA TCTGGGGTAT CTii « m rrnC*X CTCTAGGTTG 

~nr*r GAT CGAAGAG TTCGCCAGGA ACCAGGCCAT CH-i 
rirnAGTTGA T CAACCAGAG GATCGAA^-^ ^^» rrrr RT 
GAGCAt3i ^,.,., rrT TCAGAGAGTG GGAAGCCGAT 

„, rl -p. rtCTCT A CCAAAT CT AT GCAGAGAGCT TCAUKU 

gaaggattga gcaatctcta CGTATTC aat tcaacgacat gaacagcgcc 

CCTACTAACC CAGCTCTCCG CGAGGAAAT G CGTATTCAAT ^rrCCGTG 
CCTACTAA GTTCGCAGTC CAGAACTACC AAGTTCCTCT CTTGTCCGTG 

TTGACCACAG CTATCCCATT ton r , ri>Tirr _. GTTTG GGCAA 

. nvrCTAATCT TCACCTCAGC GTGCTTCGAG ACGTTAGCGT 
TACGTTCAAG CAGCTAATCT TCA arracC TTAC TAGGCTGATT 

. BrrlTr iiT AGCCGTTACA ACGACCTTAU 
AGGTGGGGAT TCGATGCTGC AACCATCAAT AGC „, rrarcG TGTCTGGGGT 

r. n nrr TGTTCGTTGG TACAACACTG GCTTGGAGCG TGTCTG 
GGAAACTACA CCGACCACGC ™" GAGAATTGAC CCTCACAGTT 

CCTGATTCTA GAGATTGGAT TAGATACAAC CAGTTCAGGA ccGTACAGTG 

^ TPTCTCTCTT CCCGAACTAT GACTCCAGAA CCTACCCTAT CCG 
TTGGACATTG TGTCTCTCTT rCAGTTCTTG AGAACTTCGA CGGTAGCTTC 

XCCCAACTTA CCAGAGAAAT CTATACTAAC CCAGTTCTTG QGACATCTTG 

rrraarGTAT CGAAGGCTCC AT CAGGAGCC CACACTTGAT ^ 
CGTGGTTCTG CCCAAGGTAT CGAA ATTACTGG TC TGGACACCAG 

PTATCTACAG CGATGCTCAC AGAGGAGAGT ATTAU i 
AACAGCATAA CTATCTACAia rcTTTCCTCT CTATGGAACT 

^rr a rTTGG ATTCAGCGGG CCCGAGTTTA CCTTTCCTUi 
ATCATGGCCT CTCCAGTTGG ATTt-A „ arrTC AGGG TGTCTACAGA 

^ rCGCTCCACA ACAACGT AT C GTTGCTCAAC TAGGTCAGGG T 
ATGGGAAACG CCGCTCCACA tTCAATATCG GTATCAACAA CCAGCAACTT 

accttgtctt ccaccttgta cagaagaccc ttcaatx atccgctgtt 

TCCGTTCTTG ACGGAACAGA GTTCGCCTAT GGAACCTCTT C CAACAATGTG 

rrrrllir , GT TGATTCCTTG GACGAAATCC CACCACAGAA CAA^ 
TACAGAAAGA GCGGAACCGT TGAT CCATGTTCCG TTCCGGATTC 

„r> **rrATTCTC CCACAGGTTG AGCCACGTGT CCAio 
CCACCCAGGC AAGGATTCTC cCTATGTTCT CATGGATTCA TCGTAGTGCT 

AGCAACAGTT CCGTGAGCAT CATCAGAGCT TCCCATTGAC CAAGTCTACT 

-.^•^^n.r-r- TTCCTCTCAA ATCACCCAAA TCCCAU^ 
QAGTTCAACA ATATC&TTCC TICCTC XCACAGGAGG TGATRTTCTT 

ftACCTTGGRT CTGGAACTTC TGTCGTGAAA 
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AGAAGAACTT CTCCTGGCCA GATTAGCACC 
CAAAGATATC GTGTCAGGAT TCGTTACGCA 
ATCGACGGAA GGCCTATCAA TCAGGGTAAC 
TTGCAATCCG GCAGCTTCAG AACCGTCGGT 
TCAAGCGTTT TCACCCTTAG CGCTCATGTG 
CGTATTGAGT TTGTGCCTGC CGAAGTTACC 



CTCAGAGTTA AC AT C ACT GC ACCACTTTCT 
TCTACCACTA ACTTGCAATT CCACACCTCC 
TTCTCCGCAA CCATGTCAAG CGGCAGCAAC 
TTCACTACTC CTTTCAACTT CTCTAACGGA 
TTCAATTCTG GCAATGAAGT GTACATTGAC 
TTCGAGGCTG AGTAC 
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A. The indications 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 

made below relate ,o the microorganism referred to in <be description 



on page 



159 



, lines 



21-25 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



Name of depositary institution 

Agricultural Research Culture Collection (NRRI*) 



Address of depositary institution (tnclu&nz postcode and country) 

1815 Korth University Street 
Peoria, 3X 61604 



Date of deposit 

4 October 1991 (04.10-91) 



Accession Number 
HRKL B 18891 



.n^o^I.INDICATIONS^^^^^ mis infonn.tion is conUnueo on .n addition,, sheet Q 



We revest the Expert Solution 



rrnnrr -ry~— — ™ ^.cations aremade^^^^^^ 



E. SEPARATE FURNISHING OF INDICATIONS VP*c*» 

Number of Deposit') 



For receiving Office use only 



Q This sheet was received with the international application 




For International Bureau use only 



□ Tois sheet was received by the International Bureau on: 




Form PCI7RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule I3bis) 



A. Hie indications made below relate to the microorganism referred to in the description 

159 , lines 21-25 



on page 




B. IDENTIFICATION OF DEPOSIT 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 



Address of depositary institution (including postal code and country) 

1815 North University Street 

Peoria r XL 61604 

U.S.A. 



C. ADDITIONAL INDICATIONS (leave blank if not appUcable) This information is continued on an additional sheet Q) 



Date of deposit 

4 Octoher 1991 (04.10-91) 



Accession Number 
NRRL B 18892 



We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed belowwill be submitted to the International Bureau later (specify the general nature of the indications^ 'Accession 
Number of Deposit*) 



For receiving Office use only 



Tnis sheet was received with the international application 



Authorized officer 



For International Bureau use only 



^2 Tbi* sheet was received by the International Bureau 



on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Ru\cl3bis) 



A* The indications 
on page 159 



made below relate to Z microorganism referred to in the description 

, lines 21-25 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet [j 



Name of depositary institution 

*pr iOT ltural Research Culture Collection (HRRL) 



Address of depository institution (including postal code end country) 

1815 North University Street 

Peoria, XL 61604 

U.S.A. 



Date of deposit 

3 September 1992 (03.09.92) 



Accession Number 
NRRL B 18994 



t , tv v This information is continued on an additional sheet Q 
ADDITIONAL INDICATIONS (leave blank if «ot applicable) Itits iniorma , 



reediest the Expert Solution 



D. DESIGNATED STATES FOR WHICH 



INDICATIONS ARE MADE Of the truncations are not Jar all designated States) 



E g FTP AT* ATE FURNISHING OF INDICATION S (leave blank if not applicable) 

Number of Deposit") 



'Accession 



For receiving Office use only 



Q This sheet was received with the intemaUonal application 



Authorized officer 



For International Bureau use only 



Q This sheet was received by the International Bureau on: 



Authorized officer 




Form PCI7RO/134 (July 1992) 



176 



WO 93/07278 



PCT/US92/08476 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 13bis) 



A. The indications made beiow relate to the microorganism referred to in the description 
on page 159 , lines 21-25 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet | | 

Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 

Address of depositary institution (including postal code and country) 

1B15 North University Street 

Peoria, XL 61604 

U.S.A. 



Date of deposit 




Accession Number 


3 Septembe 


r 1992 (03.09.92) 


NRRL B 18995 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) Th is information is continued on an additional sheet Q 



We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of the indications are not for all designated Stales) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau la ter (specify the general nature of the indications eg., 'Accession 
Number of Deposit") 



For receiving Office use only 



| | This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



f^] This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule I3bis) 



A- The indications 



made below relate to .he mtaoorganisrn referred «o in .be description 



on pag e 



159 



lines 



21-25 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet [J 



Name of depositary institution 

AgricuXturaX Research Culture Collectio n (KRRI^ 

Address of depositary institution Occluding postal code and country) 

1815 North university Street 
Peoria,, H. 61604 
TJ-S.A-. 



Date of deposit 

4 September 1992 (04.09.92) 



Accession Number 
HRRI* B 18996 



immT — T ^^ON S U^^f^ap^U) This inf onnMion is conUnued on ,n 



additional sheet | | 



We 



request the Expert Solution 



D. DESIGNATED STATES FOR WHICH 



INDICATIONS ARE MADE (if the Indications are n<* for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS 

Number of Deposit") 



For receiving Office use only 



□ This sheet was received with the international application 




For International Bureau use only 



Q ifcis sheet was received by the International Bureau on: 




Form PCT/RO/I34 (July 1992) 



178 



WO 93/07278 



PCT/US92/08476 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule I3bis) 




Date of deposit 

21 September 1992 (21.09.92) 


Accession Number 
NRRL B 18997 


C ADDITIONAL INDICATIONS (leave blank if net applicable) This information is continued on an additional sheet □ 



A. The indications made below relate to the microorganism referred to in the description 
on page 159 , lines 21-25 



B. IDENTIFICATION OF DEPOSIT 



Name of depositary institution 

Agricultural Research Culture collection (NRRL) 



Address of depositary institution (including postal code and country) 

1815 North University Street 
Peoria, TL 61604 
U.S.A. 



we request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated Stoics) 



E- SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

Tne indications listed below will be submitted to the International Bureau later (specify the general nature cf Vindications eg., 'Accession' 
Number of Deposit ") 



For receiving Office use only 



Q This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



P^j This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIc I3bis) 





A* The indications 



on page 



made belo«r relate to the microorganism referred to in the description 



■a 



, lines 



21-25 



B . IDENTIFICATION OF DEPOSIT 



Name of depositary institution 

Agricnltural Research Cuiture 



Further deposits are identified on an additional sheet 

□ 



(KRRI») 



Address of depository institution fincludixg postal code and country) 

1815 north University Street 
Peoria, TE. 61604 
U.S.A. 



Date of deposit 

21 September 1992 (2X.09.92) 



Accession Number 
NRRL B 18998 



UTimnTTU nmT-nT— f^ — . This mfonnauo, is continued on an additional .beet □ 



We request the Expert 



; -pi — TprT ITT" — ~™ ™™ INDICATIONS ARE MADE 



SEPARATE FURNISHING OF INDICATIONS (teW^.f^) 



The indications listed below will be submitted to the 
Number of Deposit') 



International Bureau U^spcafyihcg^l^r.ofih.UuU^orse.g., 'Acc^on 



J- 



For receiving Office use only 



Q Ibis sheet was received with the international application 




- For International Bureau use only 



□ This sheet was received by the International Bureau on: 




Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc I3bis) 



B- IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet 


□_ 


Name of depositary institution 






Agricultural Research Culture Collection (NRRI*) 




Address of depositary institution (including postal code and country) 






1815 North University Street 






Peoria, II* 61604 






U.S.A. 






Date of deposit 


Accession Number 




21 September 1992 (21-09-92) 


KRRL B 18999 




C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 


□ 



A. The indications made below relate to the microorganism referred to in the description 

159 , lines 21-25 



on page 



We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftf the indications are not far alt designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



ToemdicatioiislistedbelowwiHbesu^ 'Accession 
Number of Deposit") 



For receiving Office use only 



[*U This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



f^] This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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What: is claimed is: optimized 

aeotide sequence comprising a maize p 



1. A nuc 



cod in g SW «ce -Mo* a»eodas a pro.ain =apa>l. .« —in, an 

insect. ^ 

2. A nucleotide sequence of claim 1, wherein said 

coding sequence encodes a BJ^ protein. 

3. A nucleotide sequence of claim 2, wherein said 
coding sequence encodes a CrylA(b) protein. 

4 a nucleotide sequence of claim 3, wherein said 
coding sequence comprises Sequence 3, Sequence 4, or the 
sequence set forth in Fig. 7. 

nf c iaim 3, wherein coding 

5. A nucleotide sequence of claim j, 

^ B * CrvIA(b) protein which is heat stable 
sequence encodes a CryiA^j pi 

compared to a native CrylA(b) protein. 

of r c iaim 5, wherein said 

6. A nucleotide sequence of claim o, 

««^o epiected from the group 
coding sequence comprises a sequence selecte 

* v< a 9 Fig. 11. Fig. 13 and Fig. 13- 
of sequences consisting of Fig. 9, Fig. 

7 . a nucleotide sequence of claim 2, wherein said 

* PrvIB or a CrylA(c) protein, 
coding sequence encodes a Cry IB or ... 

8 A nucleotide sequence of claim 7, wherein said 

i the sequence encoding a Cry IB protein 
coding sequence comprises the sequen 

set forth in Fig. 6. , 

, * nuclaotida sa^anca of =lai» 1. -«har co^r-n, 
. £irst p»-~r capafcla o £ dir.e t i», a-prassion of a nuclaotida 
5eqoe ,c« in a plan, call, oparaMy li»*ed Co said codin, 



sequence . 

10 



A nucleotide sequence of claim 9, wherein said 
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promoter is capable of directing expression of the associated 
coding sequence in a maize cell. 

11. A nucleotide sequence of claim 9, wherein said 
promoter is selected from the group consisting of inducible 
promoters r constitutive promoters, temporal or 
developmentally-regulated promoters, tissue-preferred, and 

tissue-specific promoters . 

12. A nucleotide sequence of claim 9, wherein said 
promoter is selected from the group consisting of a CaMV 35S 
promoter, CaMV 19S promoter, a PEP carboxylase promoter, a 
pith-preferred promoter, and a pollen-specific promoter. 

13. A nucleotide sequence of claim 9, wherein said 
promoter is a pith-preferred promoter comprising the DNA 
sequence set forth in Fig. 24. 

14. A nucleotide sequence of claim 9, wherein saxd 
promoter is a pollen-specific promoter comprising the DNA 

sequence set forth in Fig. 35. 

15. A nucleotide sequence of claim 9, further 
comprising a second promoter capable of directing expression of 
an associated coding sequence in a plant cell, operatively 
linked to a second coding sequence. 

17. A nucleotide sequence of claim 15, wherein said 
second promoter is selected from the group consisting of 
inducible promoters, constitutive promoters, temporal or 
developmentally-regulated promoters, tissue-preferred, and 

tissue-specific promoters. 

18. A nucleotide sequence of claim 15, wherein said 
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. • . elected from the group consisting of a CaMV 
second promoter is selected tro 

353 promoter, CaM, .SB promoter, a PE P carboxylase promoter, 
pith-preferred promoter, and a pollen-specific promoter. 

21 A nucleotide sequence of claim 15, wherein saxd 
second coding sequence is a plant optimized coding sequence 
which encodes a protein capable of killing an insect. 

22. A nucleotide sequence of claim 21, wherein said 
second coding sequence encodes a protein. 

23. A nucleotide sequence of claim 22, wherein said 
second coding sequence encodes a CrylA(b) protein. 

29. A nucleotide sequence of claim 15, wherein said 

second coding sequence is a marker gene . 

30. A recombinant vector, comprising a nucleotide 

sequence of claim 9. 

31 A recombinant vector, comprising a nucleotide 

sequence of ciaim 9, wherein said coding sequence encodes a »1 
protein. 

32. A recombinant vector of claim 31, wherein the B^t_ 

protein is a CrylA(b) protein. 

33. A recombinant vector, comprising a nucleotxde 

sequence of claim 15 - 

34. A recombinant vector, comprising a nucleotide 

sequence of claim 18. 

35. A recombinant vector, comprising a nucleotide 

sequence of claim 22. 

36. A plant stably transformed with a nucleotide 

sequence of claim 9. 

184 



WO 93/07278 



PCT/US92/08476 



37. A plant stably transformed with a nucleotide 

sequence of claim 12. 

38. A plant of claim 36, wherein said coding sequence 

encodes a B.t . protein. 

39. A plant of claim 36, wherein said protein is 
expressed in said plant in an amount sufficient to control 
Lepidopteran or Coleopteran insects. 

40. A plant of claim 38, wherein the B.t. protein is a 

CrylA(b) protein. 

41. A plant of claim 38, which expresses the B.t. 
insecticidal protein in an amount sufficient to control 
Lepidopteran or Coleopteran pests. 

42. A plant of claim 38, wherein the amount is 
sufficient to control insects selected from the group consisting 
of European corn borer, Sugarcane borer, stalk borers, cutworms, 
armyworms, rootworms, wireworms and aphids. 

43. A plant stably transformed with a nucleotide 

sequence of claim 15 . 

44. A plant stably transformed with a nucleotide 

sequence of claim 18. 

45. A plant of claim 43, wherein the first and second 

coding sequences each encode a BJ^ protein. 

46. A plant of claim 38, which expresses the B^t. 
insecticidal proteins in an amount sufficient to control 
Lepidopteran or Coleopteran pests. 

47. A plant of claim 46, wherein the amount is 

sufficient to control stalk borers. 
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18 An isolated and purified promoter capable of 
Erecting pith-preferred expression of an associated structural 
gene in a plant. 

, ,> ffl 4 o isolated from a monocot . 

49. A promoter of claim 4B r isoi^c 

aft 4 qniated from a maize 

50. A promoter of claxm 48, isoiacea 

plant - 

51. A promoter of claim 48, isolated from a plant 
tryptophan synthase-alpha (TrpA) subunit gene. 

52. A promoter of claim 51, isolated from a maize 
tryptophan synthase-alpha (TrpA) subunit gene . 

53 . A promoter of claim 48, comprising the sequence set 

forth in Figure 24. 

5 4 a recombinant DNA molecule comprising a promoter 

clais, «.. operabl, associated with a structural gene encoding a 

protein of interest. . 

55 A recombinant DNA molecule of claim 54, -herein 

saiu structural gene encodes an insecticidal protein . 

s6 a recombinant DNA molecule of claim 55. wherein 
said structural gene encodes a Bacillus. P"" 1 "" 

„ . A vector, comprising a recombinant DNA molecule of 

Clai " 54 ' , . lsi „ 57 wherein said structural gene 

58. A vector of claim 57, wneien 

encodes an insecticidal protein. 

, • *n wherein said structural gene 

59. A vector of claim 57, wherein 

a Bacillus t-hiiT-inaiensis protein." 

encodes a Bacinu» a. „«„*. nwa 

M A vector, comprising at least two recombinant 

nol ecules of claim 54, wherein at least one of the two 
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structural genes encodes an insecticidal protein. 

61. A plant stably transformed with recombinant DNA 
molecule of claim 54. 

62. A plant of claim 61, which is a maize plant. 

63. A purified promoter capable of directing 
pollen-specific expression of an associated structural gene in a 
plant , wherein said promoter is isolated from a plant 
calcium-dependent phosphate kinase (CDPK) gene. 

64. A promoter of claim 63, isolated from a monocot CDPK 

gene . 

65. A promoter of claim 63 , isolated from a maize CDPK 

gene . 

66. A promoter of claim 65, comprising the sequence set 
forth in Figure 35 . 

67. A recombinant DNA molecule, comprising a promoter 
of claim 63, operably associated with a structural gene encoding 
a protein of interest . 

68. A recombinant DNA molecule, of claim 67, wherein 
said structural gene encodes an ' insecticidal protein. 

69. A recombinant DNA molecule of claim 68, wherein 
said structural gene encodes a Bacillus thuringiensis protein. 

70. A vector, comprising at least one recombinant DNA 

molecule of claim 67. 

71. A vector of claim 70, wherein said structural gene 

encodes an insecticidal protein. 

72. A vector of claim 71, wherein said structural gene 

encodes a Bacillus thuringiensis protein. 
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73. A vector of claim 70, comprising two recombinant 

4- one of the two structural genes 

DNA molecules, wherein at least one or 

encodes an insecticidal protein. 

74. A plant stably transformed with a recombinant DNA 

molecule of claim 67 . 

75 A plant of claim 74, which is maize plant. 

76. A maize plant stably transformed with at least one 

recombinant DNA molecule; 

wherein said DNA molecule comprises a promoter 

operably linked to a nucleotide sequence encoding an 

insecticidal protein. 

wherein said promoter is capable of directing 
tissue-preferred or tissue-specific expression of said gene in 

said maize plant. 

77. A maize plant of claim 76, wherein said gene 

encodes a Bacillus thnrlnaienais protein . 

78. A plant of claim 76, wherein said promoter is 

obtained from a monocot . 

79. A plant of claim 78, wherein said monocot is 

maize. 

60. A maize plant of claim 76. wherein said promoter 
is capable of directing pith-pref erred expression of said gen. 

in said maize plant. 

81. A plant of claim 76, wherein said promoter is 
obtained from a plant tryptophan synthase- alpha subunit gene. 

82. A plant of claim 76, wherein said promoter is 
obtained from a maize tryptophan synthase-alpha subunit gene. 
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83. A plant of claim 82, wherein said promoter 
comprises the sequence set forth in Figure 24. 

84. A plant of claim 76, wherein said promoter is 
capable of directing green tissue-specific expression of said 
gene in said maize plant. 

85. A plant of claim 84, wherein said promoter is a 
PEP carboxylase promoter. 

86. A maize plant of claim 76, wherein said promoter 
is capable of directing pollen-specific expression of said gene 

in said maize plant. 

87. A maize plant of claim 76, wherein said promoter 
is obtained from a plant calcium-dependent phosphate kinase 
gene . 

88. A maize plant of claim 76, wherein said promoter 
is obtained from a maize calcium-dependent phosphate kinase 
gene . 

89. A maize plant of claim 76, wherein said promoter 
comprises the sequence set forth in Figure 35. 

90. A method of producing a maize optimized coding 
sequence for an insecticidal B.t. protein, comprising: 

determining the amino acid sequence of a predetermined 
insecticidal B.t. protein, and 

altering the coding sequence of the protein by 
substituting codons which are most preferred in maize for 
corresponding native codons. 

91. A method of protecting a maize plant against at 

least one insect pest, comprising: 
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i~r* * maize plant with at least one 
stably transforming a maize P^ 1 

nuclide sequence of claim 9, wherein the coding seance 
encodes an insecticidal protein; whereby the transformed maize 

. • 4nsectic idal protein in an amount sufficxent 
plant expresses the msecticiaai 

to protect the plant against the pest. 

92. The method of claim 91, wherein said insecticidal 

protein is a B.t. protein. 

93. The method of claim 91, wherein said promoter is a 

tissue-specific or a tissue-preferred promoter. 

94 . A plant which has been stably transformed with a 
nucleotide sequence wherein said nucleotide sequence comprises a 
ma ize optimized coding sequence for an insecticidal protein, 
wherein said protein is expressed in said transformed plant at 
least 100 fold greater than egression of the protein using a 
native coding sequence. 
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64 ATGGACCTGC TGCCCGACGC CCGCATCGAG C^CAGCCTGT ^TC^CGA GGGCAACAAC 
MeLAspLeu LeuProAsp AlaArglleGlu AspSerLeu CysIleAla GluGlyAsnAsn 

124 ATCGACCCCT TCGTGAGCGC CAGCACCGTG CAGACCGGCA TCj^TOGC MGCCGCATC 
IleAspPro PheValSer AlaSerTtirVal GlnThrGly IleAsnlle AlaGlyArglle 

184 CTGGGCGTGC TGGGCGTGCC CTTCGCCGGC ^^^^^"^SeSKSSSS 
LeuGlyVal LeuGlyVal ProPheAlaGly GlnLeuAla SerPheTyr SerPDeLeuvai 

24 4 GGCGAGCTGT GGCCCCGCGG CCGCGACCAG TGGGAGATCT TCCTGGAGCA CGTGGAGCM 
GlyGluLeu TrpProArg GlyArgAspGln TrpGluIle PheLeuGlu HisValGluGln 

30 4 CTGATCAACC AGCAGATCAC CGAGAACGCC CGCAACACCG CCCTGGCCCG CCTGCAGGGC 
LeuIleAsn GlnGlnlle ThrGluAsnAla ArgAsnThir AlaLeuAla ArgLeuGlnGly 

364 CTGGGCGACA GCTTCCGCGC CTACCAGCAG AGCCTGGAGG ACTGGCTGGA GAACCGCGAC 
LeuGlyAsp SerPheArg AlaTyrGlnGln SerLeuGlu AspTrpLeu GluAsnArgAsp 

424 GACGCCCGCA CCCGCAGCGT GCTGTACACC CAGTACATCG CCCTGGAGCT GGACTTCCTG 
AspAlaArg ThrArgSer ValLeuTyrThr GlnTyrlle AlaLeuGlu LeuAspPheLeu 

484 AACGCCATGC CCCTGTTCGC CATCCGCAAC CAGGAGGTGC CCCTGCTGAT GGTGTACGCC 
AsnAlaMet ProLeuPhe AlalleArgAsn GlnGluVal ProLeuLeu MetValTyrAla 

544 CAGGCCGCCA ACCTGCACCT GCTGCTGCTG CGCGACGCCA GCCTGTTCC^ CAGCGAGTTC 
GlnAJLaAla AsnLeuHis LeuLeuLeuLeu ArgAspAla SerLeuPhe GlySerGluPhe 

604 GGCCTGACCA GCCAGGAGAT CCAGCGCTAC TACGAGCGCC AGGTG^GCG^CACCCGCGAC 
GlyLeuThr SerGlnGlu IleGlnArgTyr TyrGluArg GlnValGlu ArgTbrArgAsp 

664 TACAGCGACT ACTGCGTGGA GTGGTACAAC ACCG^CTGA AC^GCCTGCG CG^CCAAC 
TyrSerAsp TyrCysVal GluTrpTyrAsn ThrGlyLeu AsnSerLeu ArgGlyTnrAsn 

724 GCCGCCAGCT GGGTGCGCTA CAACCAGTTC CGCCGCGACC TGACCCTGGG CGTGCTGGAC 
AlaAlaSer TrpValArg TyrAsnGlnPhe ArgArgAsp LeuThrLeu GlyValLeuAsp 

784 CTGGTGGCCC TGTTCCCCAG CTACGACACC CGCACCTACC CCATC^CAC^GCGCCCAG 
LeuValAla LeuPhePro SerTyrAspThr ArgThrTyr ProIleAsn ThrSerAlaGl* 
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1084 GGCCTGAACA CCAGCACCCA CGGCGCCACC AACACCAGCA TCAACCCCGT GACCCTGCGC 
GlyLeuAsn ThrSerThr HisGlyAlaThr AsnThrSer IleAsnPro ValThrLeuArg 

1144 TTCGCCAGCC GCGACGTGTA CCGCACCGAG AGCTACGCCG GCGTGCTGCT GTGGGGCATC 
PheAlaSer ArgAspVal TyrArgThrGlu SerTyrAla GlyValLeu LeuTrpGlylle 

1204 TACCTGGAGC CCATCCACGG CGTGCCCACC GTGCGCTTCA ACTTCACCAA CCCCCAGAAC 
TyrLeuGlu ProIleHis GlyValProThr ValArgPhe AsnPheThr AsnProGlnAsn 

1264 ATCAGCGACC GCGGCACCGC CAACTACAGC CAGCCCTACG AGAGCCCCGG CCTGCAGCTG 
IleSerAsp ArgGlyThr AlaAsnTyrSer GlnProTyr GluSerPro GlyLeuGlnLeu 

1324 AAGGACAGCG AGACCGAGCT GCCCCCCGAG ACCACCGAGC GCCCCAACTA CGAGAGCTAC 
LysAspSer GluThrGlu LeuProProGlu ThrThrGlu ArgProAsn TyrGluSerTyr 

1384 AGCCACCGCC TGAGCCACAT CGGCATCATC CTGCAGAGCC GCGTGAACGT ^CGTGTAC 
SerHisArg LeuSerHis IleGlyllelle LeuGlnSer AxgValAsn ValProValTyr 

4 44 AGCTGGACCC ACCGCAGCGC CGACCGCACC AACACCATCG GCCCCAACCG CATCACCCAG 
SerTrpThr HisArgSer AlaAspAxgThr AsnThrlle GlyProAsn ArgIleTh_rGLn 

1504 ATCCCCATGG TGAAGGCCAG CGAGCTGCCC CAGGGCACCA CCGTGGTGCG CGGCCCCGGC 
IleProMet ValLysAla SerGluLeuPro GlnGlyTtir ThrValVal ArgGlyProGly 

1564 TTCACCGGCG GCGACATCCT GCGCCGCACC AACACCGGCG GCTTCGGCCC CATCCGCGTG 
PheThrGly GlyAspIle LeuArgArgThr AsnThrGly GlyPheGly ProIleArgVai 

1624 ACCGTGAACG GCCCCCTGAC CCAGCGCTAC C ^^ C ^ CT p ^?^I^ G A S2SSSIl 
ThrValAsn GlyProLeu ThrGlnArgTyr ArglleGly PheArgTyr AlaSerThxVal 

1 RR4 OAPTTCGACT TCTTCGTGAG CCGCGGCGGC ACCACCGTGA ACAACTTCCG CTTCCTGCGC 
168 SpSS^p PhSJeSal SerArgGlyGly ThrThrVal AsnAsnPhe ArgPheLeuArg 

, nAA , rrlTr , X rA GCGGCGACGA GCTGAAGTAC GGCAACTTCG TGCGCCGCGC CTTCACCACC 
^Se^S^SSp^luIeuLysTyr GlyAsnPhe ValAxgArg AlaPheThrThr 

, on . rrrTT nrrT TCACCCAGAT CCAGGACATC ATCCGCACCA GCATCCAGGG CCTGAGCGGC 
1804 plSSSt^l^nSpIle IleAxgThx SerlleGln GlyLeuSerGly 

1864 AACGGCGAGG TGTACATCGA CAAGATCGAG ATCATCCCCG ^g^^^TTCGAGGCC 
AsnGlyGlu ValTyrlle AspLysIleGlu IlellePro ValTHrAla THrPHeGluAla 

■to? a mrTircarr TGGAGCGCGC CCAGGAGGCC GTGAACGCCC TGTTCACCAA CACCAACCCC 
19 SSJSp lISSS^^IaSnGluAla ValAsnAla LeuPheThr AsnThrAsnPro 

1QA4 rRrr.CCCTGA AGACCGACGT gaccgactac cacatcgacc aggtgagcaa CCTGGTGGCC 
198 Sg^2S\yS£Sp ValThrAspTyr HisIleAsp GlnValSer AsnLeuValAla 

onAA wrTr , Anr r ACGAGTTCTG CCTGGACGAG AAGCGCGAGC TGCTGGAGAA GCTGAAGTAC 
2044 cSSSS aJpSSS CysLeuAspGlu LysArgGlu LeuLeuGlu LysValLysTyr 
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OCC^GCGCC TGAGCGACGA GCGCAACCTG £™£C ™T« ? C^TCJJC 
AlaLysArg LreuSerAsp GluArgAsnLeu LeuGinAsp rru^x 

AAGCAGCCCG ACTTCATCAG CACCAACGAG CAGAGCAACT TCACC^T CCACGAGCAG 
LysGlnPro AspPhelle SerThrAsnGlu GlnSerAsn PheThrSer IleHisGluGln 

IOT1 r^Arr GCTGGTGGGG CAGCGAGAAC atcaccatcc aggagggcaa cgacgtgttc 
sSSSSs gS^^I^SuA^h IleThrlle GlnGluGly AsnAspValPhe 

AAGGAGAACT ACGTGACCCT GCCCGGCACC TTCAACGAGT ^TACCCCAC CTACCTGTAC 
LysGluAsn TyrValThr LeuProGlyThr PheAsnGlu CysTyrPro ThrlyrLeuTyr 

CAGAAGATCG GCGAGAGCGA GCTGAAGGCC TACACCCGCT ACCAGCTGCG gGCTACATC 
GlnLysIle GlyGluSer GluLeuLysAla TyrThrArg TyrGlnLeu ArgGlyTyrlle 

2404 GAGGACAGCC AGGACCTGGA GATCTACCTG ATCCGCTACA ACGCCAAGCA CGAGACCCTG 
GluAspSer GluAspLeu GluIleTyrLeu IleAxgTyr AsnAlaLys HisGluThrLeu 

24 64 GACGTGCCCG GCACCGAGAG CCTGTGGCCC CTGAGCGTGG AGAGCCCCAT CGQCCGCTGC 
AspValPro GlyThrGlu SerLeuTrpPro LeuSerVal GluSerPro IleGlyArguys 

GGCGAGCCCA ACCGCTGCGC CCCCCACTTC GAGTGGAACC CCGACCTGGA CTGCAGCTGC 
" 24 gSSS^SSgCyl^laProHisPhe GluTrpAsn ProAapLeu AspCysSerCys 

=f ^S!Sg?5S55S5S SS»SS 
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PAfTRCACCG CCCACCAGGG CACCGCCGGC TGCGCCGACG CCTGCAACAG CCGCAACGCC 
25SSS^*S3SSS^lyThrAl«Gly CysAlaAsp AlaCysAsn SerArgAsnAla 

GGCTACGAGG ACGCCTACGA GGTGGACACC JCCGCCAGCG v ™5gfJ y 3S3£3£ 
GlyTyrGlu AspAlaTyr GluValAspThr ThrAlaSer ValAsnTyr ^roiar y 

GAGGAGGAGA CCTACACCGA CGTGCGCCGC C^CAACCACT GCGAGTACC^CCG^TAC 
GluGluGlu ThrTyrTUr AspValArgArg AspAsnHis CysGluTyr AspArgGlyTyr 



3664 GAGCTGCTGC TGATGGAGGA GTAG 
GluLeuLeu LeuMetGlu Glu 
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184 CTGGGCGTGC TGGGCGTGCC CTTCGCCGGC "GCTGOCC* JSJSS^I.SSSISS! 
LeuGlyVal LeuGlyVal ProPheAlaGly GlnLeuAla Mtmeiyi 

CAGGCCGCCA ACCTGCACCT GCTGCTGCTG CGCGACGCCA GCCTGTTCGG CAGCGAGTTC 
SSSSf AinSSSs LeuLeuLeuLeu ArgAspAla SerLeuPhe GlySerGluPbe 

CGCCTGACCA GCCAGGAGAT CCJOCGCgC ^^ggSSSTZg^SgZ 

CTGACCCGCG AGGTGT ACAC CGACGCCATC g»g£f .^SS^SSSSS 
LeuTHrAxg GluValTyr ThrAspAlalle GlyAaaTnr 

„„„ ^pr.rr-rTr AGCGCCATCG AGGCCGCCGC CATCCGCAGC 

rr*r-rKrjTTC ACCATCTTCA GCGCCAGCAG CCGCTGGAGC 

rTr^rrxr^C CACACCATCC AGAGCCGCCC CATCGGCGGC 
102< JSK^Effi^SSSS SSSrll. P.oHeGiyGXy 
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^^rpp.rr PTOrOCTTCA ACTTCACCAA CCCCCAGAAC 

AACGGCGAGG TGTACATCGA CAAGATCGAG ™~«*^S?55SSS£ 
AsnGlyGlu ValTyrlle GTGVICGCCC TGTTCACCAA CACCAACCCC 

GAGTACGACC TGGBGCGCGC CCAGGSGGCC ?™MCWAA. AsnThrAsnPro 
GluTyrAsp LeuGluArg AlaGlnGluAla vaiAsiuu.a 

„™ «^.rr>f CACATCGACC AGGTGAGCAA CCTGGTGGCC 
CGCCGCCTGA AGACCGACGT GACCGACTAC jg"*^*; GlnValSer AsnLcuValAla 
ArgArgl^u LysThrAsp valThrAspTyr Hislieusp 
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2104 GCCAAGCGCC TGAGCGACGA GCGCAACCTG CTGCAGGACC CCAACTTCAC CAGCATCAAC 
AlaLysAxg LeuSerAsp GluAxgAsnLeu LeuGlnAsp ProAsnPhe ThrSerlleAsn 

2164 AAGCAGCCCG ACTTCATCAG CACCAACGAG CAGAGCAACT TCACCAGCAT CCACGAGCAG 
LysGlnPro AspPhellc SerThrAsnGlu GlnSerAsn PheThrSer HeHisGluGln 

2224 AGCGAGCACG GCTGGTGGGG CAGCGAGAAC ATCACCATCC AGGAGGGCAA CGACGTGTTC 
SerGluHis GlyTrpTrp GlySerGluAsn IleThxIle GlnGluGly AsnAspValPhe 

2284 AAGGAGAACT ACGTGACCCT GCCCGGCACC TTCAACGAGT GCTACCCCAC CTACCTGTAC 
LysGluAsn TyrValThr I^uProGlyThr PheAsnGlu CysTyrPro ThrTyrLeuTyr 

2 34 4 CAGAAGATCG GCGAGAGCGA GCTGAAGGCC TACACCCGCT ACCAGCTGCG CGGCTACATC 
GlnLysIle GlyGluSer GluLeuLysAla TyrThrArg TyrGlnLeu AxgGlyTyrlle 

24 04 GAGGACAGCC AGGACCTGGA GATCTACCTG ATCCGCTACA ACGCCAAGCA CGAGACCCTG 
GluAspSer GlnAspLeu GluIleTyrLeu IleArgTyr AsnAlaLys HisGluThrLeu 

24 64 GACGTGCCCG GCACCGAGAG CCTGTGGCCC CTGAGCGTGG AGAGCCCCAT CGGCCGCTGC 
AspValPro GlyThrGlu SerLeuTrpPro LeuSerVal GluSerPro IleGlyAxgCys 

2524 GGCGAGCCCA ACCGCTGCGC CCCCCACTTC GAGTGGAACC CCGACCTGGA CTGCAGCTGC 
GlyGluPro AsnArgCys AlaProHisPhe GluTrpAsn ProAspLeu AspCysSerCys 

2584 CGCGACGGCG AGAAGTGCGC CCACCACAGC CACCACTTCA GCCTGGACAT CGACGTGGGC 
ArgAspGly GluLysCys AJLaHisHisSer HisHisPhe SerLeuAsp IleAspValGly 

264 4 TGCACCGACC TGCACGAGAA CCTGGGCGTG TGGGTGGTGT TCAAGATCAA GACCCAGGAG 
CysThrAsp LeuHisGlu AsnLeuGlyVal TrpValval PheLysIle LysThrGlnGlu 

2704 GGCCACGCCC GCCTGGGCAA CCTGGAGTTC ATCGAGGAGA AGCCCCTGCT GGGCGAGGCC 
GlyHisAla ArgLeuGly AsnLeuGluPhe IleGluGlu LysProLeu LeuGlyGluAla 

27 64 CTGAGCCGCG TGAAGCGCGC CGAGAAGAAG TGGCGCGACA AGCGCGAGAA GCTGCAGCTG 
LeuSerArg ValLysArg AlaGluLysLys TrpArgAsp LysArgGlu LysLeuGlnLeu 

2824 GAGACCAAGC GCGTGTACAC CGAGGCCAAG GAGGCCGTGG ACGCCCTGTT CGTGGACAGC 
GluThrLys ArgValTyr ThrGluAlaLys GluAlaVal AspAlaieu PheValAspSer 

2884 CAGTACGACC GCCTGCAGGC CGACACCAAC ATCGGCATGA TCCACGCCGC CGACAAGCTG 
GlnTyrAsp ArgLeuGln AlaAspThrAsn IleGlyMet IleHisAla AlaAspLysLeu 

2944 GTGCACCGCA TCCGCGAGGC CTACCTGAGC GAGCTGCCCG TGATCCCCGG CG T GAACGCC 
Va^sS^IleArgGlu AlaTyrLeuSer GluLeuPro ValllePro GlyValAsnATa 

3004 GAGATCTTCG AGGAGCTGGA GGGCCACATC ATCACCGCCA TCAGCCTGTA CGACGCCCGC 
GluIlePhe GluGluLeu GluGlyHisIle IleThxAla IleSerLeu TyrAspAlaArg 
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3064 AACGTGGTGA AGAACGGCGA CTTCAACAAC GGCCTGACCT GCTGGAACGT GAAGGGCCAC 
A^alVal LysAsnGly AspPheAsnAsn GlyLeuThr CysTrpAsn ValLysGlyHis 

3124 GTGGACGTGC AGCAGAGCCA CCACCGCAGC GACC TGGTGA GgGGCCCK.. 
ValAspVal GlnGlnSer HisHisArgSer AspLeuVal IleProGlu TrpGluAlaGlu 

3184 GTGAGCCAGG CCGTGCGCGT GTGCCCCGGC TGCG^ACA TCCTGCGCGT GACCGCCTAC 
ValSerGln AlaValArg ValCysProGly CysGlyTyr IleLeuArg ValThrAlaTyr 

324 4 AAGGAGGGCT ACGGCGAGGG CTGCGTGACC ATCCACGAGA TCGAGAACAA CACCGACGAG 
SJSSuSy Ty rGlyGlu GlyCysValThr IleHisGlu IleGluAsn AsnThrAspGlu 

330 4 CTGAAGTTCA AGAACCGCGA GGAGGAGGAG GTGTACCCCA CCGACACCGG CACCTGCAAC 
^euIysPhe LysAsrxArg GluGluGluGlu ValTyrPro ThrAspThr GlyThrCysAsn 

3364 GACTACACCG CCCACCAGGG CACCGCCGGC TGCGCCGACG CCTGGAACAG CCGCAACGCC 
SpTyrThr AlaHisGln GlyThrAlaGly CysAlaAsp AlaCysAsn SerArgAsnAla 

34 24 GGCTACGAGG ACGCCTACGA GGTGGACACC ACCGCCAGCG TGAACTACAA GCCCACCTAC 
SyJyrSu AspAlaTyr GluValAspThr TiurAlaSer ValAsnTyr LyaProTHrTyr 

-*a*a r»r^ia-arA CC T ACACCGA CGTGCGCCGC GACAAC CACT GCGAGTACGA CCGCGGCTAC 
3 84 T S™SS^spValAxgArg AspAsnHls CysGluTyr AspArgGlyTyr 

pTrAAPTACC CCCCCGTGCC CGCCGGCTAC GTGACCAAGG AGCTGGAGTA CTTCCCCGAG 
3 SaSiSJr C p5oSo?IrPr5AlaGlyTyr ValThrLys GluLeuGlu TyrPheProGlu 

*rrr*r*rCG TGTGGATCGA GATCGGCGAG ACCGAGGGCA AGTTCATCGT GGACAGCGTG 
ihrS^S G vIS?p1Ie^lu?ieGlyGlu ThrGluGly LysPhelle ValAspSerVal 

3664 GAGCTGCTGC TGATGGAGGA GTAG 
GluLeuLeu LeuJWetGlu Glu 
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SEQUENCE OF THE FULL-LENGTH HYBRID SYNTHETIC /NATIVE CRYIA(B) CHIMERIC GEHE 
The fusion point between the synthetic and native coding sequences is 
indicated by a slash (/) in the sequence . 

1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
VaLAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CAT CGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheAxgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThrThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

4 81 TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG AC GTCAGCGT GTTCGGCCAG 
TyrValGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer ValPheGlyGLn 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAspAla AlaThrlleAsn SerArgTyr AsnAspLeu ThrArgLeuIle 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGT 
GlyAsnTyr ThrAspHis AlaValArgTrp TyrAsnThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTC 
ProAspSer ArgAspTrp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVal 

* 721 CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTC 
LeuAspIle ValSerLeu PbeProAsnTyr AspSerArg ThrTyrPro IleArgThrVal 

\ 781 AGCCAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
SerGlnLeu ThrArgGlu IleTyrThrAsn ProValLeu GluAsnPhe AspGlySerPne 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MetAspIleLeu 

901 AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
AsnSerlle ThrlleTyr ThrAspAlaHis ArgGlyGlu TyrTyrTrp SerGlyHisGln 
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961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
^IScSla SerProVal GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 

1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC J<^CAGGG JGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnAxglle ValAlaGln LeuGlyGln GlyValTyrArg 

iofli irrrTrsrXA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
1081 S2SS?s2S§SS SyrSgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

l?ni TArrGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
%£S£!? s2SS55£r ValSpSerLeu AspGluIle ProProGln AsnAsnAsnVal 

rrACTTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
pSproSS cSSJ?S«rSirtrgL«u SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA JCTG^TTCA CCGCAGTOCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgserAia 

PACTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
1381 SSJSSS^alSISIS ProSerSerGln HeThrGln IleProLeu ThrLysSerThr 

i/i-i mrrraaCT* GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
1441 SS2SSfsSS5iS?1e;SalV.lLys GlyProGly PheThrGly GlyAspIleLeu 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
ArgArgThr SerProGly GlnlleSerThr LeuArgVal AsnlleThr AlaProLeuSer 

i CC i n ir ^nrr GCGTCCGCAT ccgctacgcc AGCACCACCA ACCTGCAGTT CCACACCAGC 
1561 SSJSSS ASvISSTllSr^yrAla SerThrThr Asnl^uGln PheHisThxSer 

A-rrrACGGCC GCCCCATCAA CCAGGGCAAC T TCAGCGCCA CCATGAGCAG CGGCAGCAAC 

A^SSIrAsnGlnGlyAsn PheSerAla ThrMetSer SerGlySerAsa 

CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
SSS3Sf C?3Sl» ArgThrValGly PheTbrThr ProPheAsn PheSerAsnGly 

Anri^rrrT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC 
T pSSS2^erSaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 

rnr*TCKAn TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
SSS^pSSSS^SSStttta PheGluAla GluTyrAsp LeuGluArgAla 

rirlir ™ r TGAACGAGCT GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 
SSSSSf vSSSS^IeuPheThrSer SerAsnGln IleGlyl^u LysThrAspVal 
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* 1921 ACCGACTACC ACATCGAT / CA AGTATCCAAT TTAGTTGAGT GTTTATCTGATGAATTT TGT 
ThrAspTyr HisIleAsp/GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

' 1981 CTGGATGAAA AAAAAGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 

2041 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAACTAGA CCGTGGCTGG 

ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn AxgGlnLeu AspArgGlyTrp 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TAT TCAAAGA GAATTACGTT 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVal 

2161 ACGCTATTGG GTACCTTTGA TGAGTGCTAT CCAACGTATT TATATCAAAA AATAGATGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGlu 

2221 TCGAAATTAA AAGCCTATAC CCGTTACCAA TTAAGAGGGT ATATCGAAGA TAGTCAAGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuAjrgGly TyrlleGlu AspSerGlnAsp 

2281 TTAGAAATCT ATTTAATTCG CTACAATGCC AAACACGAAA CAGTAAATGT GCCAGGTACG 
LeuGluIle TyrLeuIle ArgTyrAsnAJLa LysHisGlu ThrValAsn ValProGlyThr 

2341 GGTTCCTTAT GGCCGCTTTC AGCCCCAAGT CCAATCGGAA AATGTGCCCA TCATTCCCAT 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysAla HisHisSerHxs 

24 01 CATTTCTCCT TGGACATTGA TGTTGGATGT ACAGACTTAA ATGAGGACTT AGGTGTATGG 
HisPheSer LeuAspIle AspValGlyCys ThrAspLeu AsnGluAsp LeuGlyValTrp 

2 4 61 GTGATATTCA AGATTAAGAC GCAAGATGGC CATGCAAGAC TAGGAAATCT AGAATTTCTC 
ValllePhe LysIleLys ThrGlnAspGly HisAJLaArg LeuGlyAsn LeuGluPheLeu 

2521 GAAGAGAAAC CATTAGTAGG AGAAGCACTA GCTCGTGTGA AAAGAGCGGA GAAAAAATGG 
GluGluLys ProLeuVal GlyGluAlaLeu AlaArgVal LysArgAla GluLysLysTrp 

2581 AGAGACAAAC GTGAAAAATT GGAATGGGAA ACAAAT AT T G TTTATAAAGA GGCAAAAGAA 
ArgAspLys ArgGluLys LeuGluTrpGlu ThrAsnlle ValTyrLys GluAlaLysGlu 

2 641 TCTGTAGATG CTTTATTTGT AAACTCTCAA TATGATAGAT TACAAGCGGA TACCAACATC 
SerValAsp AJLaLeuPhe ValAsnSerGln TyrAspArg LeuGlnAla AspThxAsnlle 

27 01 GCGATGATTC ATGCGGCAGA TAAACGCGTT CATAGCATTC GAGAAGCTTA TCTGCCTGAG 
\ AlaMetlle HisAlaAla AspLysArgVal HisSerlle ArgGluAla Tyrl^uProGlu 

2761 CTGTCTGTGA TTCCGGGTGT CAATGCGGCT ATTTTTGAAG AATTAGAAGG GCGTATTTTC 
LeuSerVal IleProGly ValAsnAlaAla IlePheGlu GluLeuGlu GlyArgllePhe 

2821 ACTGCATTCT CCCTATATGA TGCGAGAAAT GTCATTAAAA ATGGTGATTT TAATAATGGC 
ThrAlaPbe SerLeuTyr AspAlaArgAsn VallleLys AsnGlyAsp PheAsnAsnGly 
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TTATrrTGCT GGAACGTGAA AGGGCATGTA GATGTAGAAG AACAAAACAA CCACCGTTCG 

T^vIrtysGlyHisVal AspValGlu GluGlnAsn AsnHisArgSer 

GTCCTTGTTG TTCCGGAATG GGAAGCAGAA GTGTCACAAG AAGTTCGTGT CTGTCCGGGT 
ValLeuVal ValProGlu TrpGluAlaGlu ValSerGln GluValArg ValCysProGly 

CGTGGCTATA TCCTTCGTGT CACAGCGTAC AAGGAGGGAT ATGttGUIGG "^TAACC 
ArgGlyTyr HeLeuArg ValTlirAlaTyr LysGluGly TyrGlyGlu GlyCysValTbr 

ATTCATGAGA TCGAGAACAA TACAGACGAA CTGAAGTTTA GCAACTGTGT AGAAGAGGAA 
SKsGlu IleGluAsn AsnThxAspGlu LeuLysPhe SerAsnCys ValGluGluGlu 

GTATATCCAA ACAACACGGT AACGTGTAAT GATTATACTG C^CTCAAGA JGAATATGAG 
ValTyrPro AsnAsnThr ValThrCysAsn AspTyrThr AlaThrGln GluGluTyrGlu 

GGTACGTACA CTTCTCGTAA TCGAGGATAT GACGGAGCCT ATGAAAGCAA TTCTTCTGTA 
^^S^xSSe^g^snAcgGlyTyr AspGlyAla TyrGluSer AsnSerSerVal 

CCAGCTGATT ATGCATCAGC CTATGAAGAA AAAGCATATA CAGATGGACG AAGAGACAAT 
S£a2p Ty^AiaSer AlaTyrGluGlu LysAIaTyr ThrAspGly AxgArgAspAsn 

rrTTPTRAAT CTAACAGAGG ATATGGGGAT TACACACCAC TACCAGCTGG CTATGTGACA 
£oC?ISS Se^SS^l^yrGlyAsp TyrThrPro LeuProAla GlyTyrValThx 

AAAGAATTAG AGTACTTCCC AGAAACC GAT AAGGTATGGA TTGAGATC(^ AG^CGG^ 
Ly sGluLeu GluTyrPhe ProGluThrAsp LysValTrp IleGluIle GlyGluThrGlu 



3421 GGAACATTCA TCGTGGACAG CGTGGAATTA CTTCTTATGG AGGAATAA 
GlyThrPhe IleValAsp SerValGluLeu LeuLeuMet GluGlu 
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1 ATGGACAACA ACCCCAACAT CAACGAGTGC 
MetAspAsn AsnProAsn IleAsnGluCys 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG 
ValGluVal LeuGlyGly GluArglleGlu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC 
SerLeuThr GlnPheLeu LeuSerGluPhe 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC 
ValAspIle IleTrpGly IlePheGlyPro 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG 
GluGlnLeu IleAsnGln ArglleGluGlu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr 

361 CCCACCAACC CCGCCCTGCG CGAGGAG AT G 
ProThrAsn ProAlaLeu ArgGluGluMet 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG 
LeuThrThr AlallePro LeuPheAlaVal 

4 81 TACGTGCAGG CCGCCAACCT GCACCTGAGC 
TyrValGln AlaAlaAsn LeuHi sLeuSer 

54 1 CGCTGGGGCT TCGACGCCGC CACCATCAAC 
ArgTrpGly PheAspAla AlaThrlleAsn 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG 
GlyAsnTyr ThrAspHis AlaValArgTrp 

6 61 CCCGACAGCC GCGACTGGAT CAGGTACAAC 

ProAspSer ArgAspTrp IleArgTyrAsn 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC 

LeuAspIle ValSerLeu PheProAsnTyr 

7 81 AGCCAGCTGA CCCGCGAGAT TTACACCAAC 

SerGlnLeu ThrArgGlu IleTyrThrAsn 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC 
ArgGlySer AlaGlnGly IleGluGlySer 

901 AACAGCATCA CCATCTACAC CGACGCCCAC 
AsnSerlle ThrlleTyr ThrAspAlaHis 

961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC 
IleMetAla SerProVal GlyPheSerGly 



ATCCCCTACA ACTGCCTGAG CAACCCCGAC- 
IleProTyr AsnCysLeu SerAsnProGiu 

ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ThrGlyTyr ThrProIle AspIleSerLeu 

GTGCCCGGCG CCGGCTTCGT GCTGGGCCT3 
ValProGly AlaGlyPhe ValLeuGlyLeu 

AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
SerGlnTrp AspAlaPhe LeuValGlnlle 

TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
PheAlaArg AsnGlnAla I leSerArgLeu 

GCCGAGAGCT TCCGCGAGTG GGAGGCCGA: 
AlaGluSer PheArgGlu TrpGluAlaAsp 

CGCATCCAGT TCAACGACAT GAAC AGCGC Z 
ArglleGln PheAsnAsp MetAsnSerAla 

CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
GlnAsnTyr GlnValPro LeuLeuSerVal 

GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
ValLeuArg AspValSer Va lPheGlyGlr. 

AGCCGCTACA ACGACCTGAC CCGCCTGATC 
SerArgTyr AsnAspLeu ThrArgLeuIle 

TACAACACCG GCCTGGAGCG CGTGTGGGGT 
TyrAsnThr GlyLeuGlu ArgValTrpGly 

CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
GlnPheArg ArgGluLeu ThrLeuThrVal 

GACAGCCGCA CCTACCCCAT CCGCACCGTG 
AspSerArg ThrTyrPro IleArgThrVal 

CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
ProValLeu GluAsnPhe AspGlySerPhe 

ATCCGCAGCC CCCACCTGAT GGACATCCTG 
IleArgSer ProHisLeu MetAspIleLeu 

CGCGGCGAGT ACTACTGGAG CGGCCAGCAG 
ArgGlyGlu TyrTyrTrp SerGlyHisGln 

CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
ProGluPhe ThrPhePro LeuTyrGlyThr 
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1C21 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

* 1CS1 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

114 1 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

12 01 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe ■ SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAla 

1 2=1 GAGT TCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC GAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln IleThrGln IleProLeu ThrLysSerThr 

14 41 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

15C1 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
ArgArgThr SerPrcGly GlnlleSerThr LeuArgVai AsnlleThr AlaProLeuSer 

1561 CAGCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAla SerThrThr AsnLeuGln PheHisThrSer 

1621 ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
IleAspGly ArgProIle AsnGlnGlyAsn PheSerAla ThrMetSer SerGlySerAsn 

1681 CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
LeuGlnSer GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

17 41 AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC 

SerSerVal PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 

1801 CGCATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
- ArglleGlu PheValPro AlaGluValThr PheGluAla GluTyrAsp LeuGluArgAla 

18 61 CAGAAGGCCG TGAACGAGCT GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 

GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 ACCGACTACC ACATCGATCA AGTATCCAAT TTAGTTGAGT GTTTATCTGA TGAATTTTGT 
ThrAspTyr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 CTGGATGAAA AAAAAGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 
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' 2C41 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAAC TAG A CCGTGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

2-01 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGTT 
ArgGlySer ThrAspIl'e ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVa. 

2'€1 ACGCTATTGG GTACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA GATCGACGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGI- 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

22 81 CTGGAAATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCACC 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrValAsn ValProGlyThr 

GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC CCCATCGGCA AGTGCGGGGA GCCGAATCGA 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysGly GluProAsnArg 

TGCGCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAA3 
CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

24 61 TGCGCCCACC ACAGCCACCA CTTCAGCCTG GACATCGACG TGGGCTGCAC CGACCTGAAC 
CysAlaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys ThrAspLeuAsr. 

2*21 GAGGACCTGG GCGTGTGGGT GATCTTCAAG ATCAAGACCC AGGACGGCCA CGCCCGCCTG 
GluAspLeu GlyValTrp VallleFheLys IleLysThr GlnAspGly HisAlaArgLe- 

2 C 8 1 GGCAATCTAG AATTTCTCGA AGAGAAACCA TTAGTAGGAG AAGCACTAGC TCGTGTGAAA 
GlyAsnLeu GluPheLeu GluGluLysPro LeuValGly GluAlaLeu AlaArgValLys 

2 641 AGAGCGGAGA AAAAATGGAG AGACAAACGT GAAAAATTGG AATGGGAAAC AAATATTGT? 
ArgAlaGlu LysLysTrp ArgAspLysArg GluLysLeu GluTrpGlu ThrAsnlleVa^ 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCT TTATTTGTAA ACTCTCAATA TGATAGATTA 
Ty rLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGln TyrAspArgLeu 

27 61 CAAGCGGATA CCAACATCGC GATGATTCAT GCGGCAGATA AACGCGTTCA TAGCATTCGA 

GlnAlaAsp ThrAsnlle AlaMetlleHis AlaAlaAsp LysArgVal HisSerlleArg 

2821 GAAGCTTATC TGCCTGAGCT GTCTGTGATT CCGGGTGTCA ATGCGGCTAT TTTTGAAGAA 
■ GluAlaTyr LeuProGlu LeuSerVallle ProGlyVal AsnAlaAla IlePheGluGlu 

2881 TTAGAAGGGC GTATTTTCAC TGCATTCTCC ^TATATGATG CGAGAAATGT CATTAAAAAT 
LeuGluGly ArgllePhe ThrAlaPheSer LeuTyrAsp AlaArgAsn VallleLysAsn 

9Q4 1 CGTGATTTTA ATAATGGCTT ATCCTGCTGG AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
G^AspSe Ai^nGly LeuSerCysTrp AsnValLys GlyHisVal AspValGluGlu 

3001 CAAAACAACC ACCGTTCGGT CCTTGTTGTT CCGGAATGGG AAGCAGAAGT GTCACAAGAA 
GlnAsnAsn HisArgSer ValLeuValVal ProGluTrp GluAlaGlu ValSerGlnGlu 
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30 61 GTTCGTGTCT GTCCGGGTCG TGGCTATATC CTTCGTGTCA CAGCGTACAA GGAGGGATAT 
ValArgVal CysProGly ArgGlyTy r I le LeuArgVal ThrAlaTyr LysGluGlyTyr 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGly CysValThr IleHisGluIle GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA TTATACTGCG 
AsnCysVal GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrA.a 

3241 ACTCAAGAAG AATATGAGGG TACGTACACT TCTCGTAATC GAGGATATGA CGGAGCCTAT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3301 GAAAGCAATT CTTCTGTACC AGCTGATTAT GCATCAGCCT ATGAAGAAAA AGCATATACA 
GluSerAsn SerSerVal ProAlaAspTyr AlaSerAla TyrGluGlu Ly sAlaTy rThr 

^361 GATGGACGAA GAGACAATCC TTGTGAATCT AACAGAGGAT ATGGGGATTA CACACCACTA 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyAsp TyrThrProLeu 

-4 21 CCAGCTGGCT ATGTGACAAA AGAATTAGAG TACTTCCCAG AAACCGATAA GGTATGGATT 
ProAlaGly TyrValThr LysGluLeuGlu TyrPhePro GluThrAsp LysValTrpI.e 

34 81 GAGATCGGAG AAACGGAAGG AACATTCATC GTGGACAGCG TGGAATTACT TCTTATGGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAspSer ValGluLeu LeuLeuMetGlu 

3541 GAATAA 
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1 ATGGACAACA ACCCCAACAT CAACGAGTGC 
MetAspAsn AsnProAsn IleAsnGluCys 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG 
ValGluVal LeuGlyGly GluArglleGlu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC 
SerLeuThr GlnPheLeu LeuSerGluPhe 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC 
ValAspIle IleTrpGly IlePheGlyPro 

2 41 GAGCAGCTGA TCAACCAGCG * C ATCGAGGAG 
GluGlnLeu IleAsnGln ArglleGluGlu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr 

2 61 CCCACCAACC CCGCCCTGCG CGAGGAGATG 
ProThrAsn ProAlaLeu ArgGluGluMet 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG 
LeuThrThr AlallePro LeuPheAlaVal 

4 81 TACGTGCAGG CCGCCAACCT GCACCTGAGC 
TyrValGln AlaAlaAsn LeuHisLeuSer 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC 
ArgTrpGly PheAspAla AlaThrlleAsn 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG 
GlyAsnTyr ThrAspHis AlaValArgTrp 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC 
ProAspSer ArgAspTrp IleArgTyrAsn 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC 
LeuAspIle ValSerLeu PheProAsnTyr 

7 81 AGCCAGCTGA CCCGCGAGAT TTACACCAAC 
I SerGlnLeu ThrArgGlu HeTyrThrAsn 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC 
ArgGlySer AlaGlnGly IleGluGlySer 

901 AACAGCATCA CCATCTACAC CGACGCCCAC 
AsnSerlle ThrlleTyr ThrAspAlaHis 

9 61 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC 
IleMetAla SerPrpVal GlyPheSerGly 



ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
IleProTyr AsnCysLeu SerAsnProGl- 

ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ThrGlyTyr ThrProIle Aspl leSerLeu 

GTGCCCGGCG CCGGCTTCGT GCTGGGCCTC- 

ValProGly AlaGlyPhe ValLeuGlyLeu 

AGCCAGTGGG ACGCCTTCCT GGTGCAGAT: 
SerGlnTrp AspAlaPhe LeuValGlnlle 

TTCGCCCGCA ACCAGGCCAT CAGCCGCCT: 
PheAlaArg AsnGlnAla I leSerArgLeu 

GCCGAGAGCT TCCGCGAGTG GGAGGCCGA3 
AlaGluSer PheArgGlu TrpGluAlaAsp 

CGCATCCAGT TCAACGACAT GAACAGCGC: 

ArglleGln PheAsnAsp MetAsnSerAla 

CAGAACTACC AGGTGCCCCT GCTGAGCGT3 
GlnAsnTyr GlnValPro LeuLeuSerVa. 

GTGCTGCGCG ACGTCAGCGT GTTCGGCCA3 
ValLeuArg AspValSer VaiPheGlyGl:. 

AGCCGCTACA ACGACCTGAC CCGCCTGATI 
SerArgTyr AsnAspLeu ThrArgLeuIle 

TACAACACCG GCCTGGAGCG CGTGTGGGGT 
TyrAsnThr GlyLeuGlu ArgValTrpGly 

CAGTTCCGCC GCGAGCTGAC CCTGACCGT3 
GlnPheArg ArgGluLeu ThrLeuThrVal 

GACAGCCGCA CC TACCCCAT CCGCACCGTG 
AspSerArg ThrTyrPrp IleArgThrVal 

CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
ProValLeu GluAsnPhe AspGlySerPhe 

ATCCGCAGCC CCCACCTGAT GGACATCCTG 
IleArgSer ProHisLeu MetAspIleLeu 

CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
ArgGlyGlu TyrTyrTrp SerGlyHisGln 

CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
ProGluPhe ThrPhePro LeuTyrGlyThr 
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1 1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG JJTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVai 



1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

12 61 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAia 

1361 GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln IleThrGln IleProLeu ThrLysSerThr 

^4 41 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AlSeuGly SerGlyThr SerValValLys GlyProGly PheThrGly. GlyAspIleLeu 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ^^ATCACCGC CCCCCTGAGC 
ArgArgThr SerProGly GlnlleSerThr LeuArgVal AsnlxeThr AlaProLeuSe. 

•561 CAGCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAla SerThrThr AsnLeuGln PheHisThrSer 

1621 ATCGACGGCC GCCCCATCAA C CAGGGC AAC TTCAGCGCCA CCATGAGCAG ^GCAGCAAC 
IleAspGly ArgProIle AsnGlnGlyAsn PheSerAla ThrMetSer SerGiyserAsn 

1681 CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
2uSns5r GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

17 41 AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT ^ACATCGAC 
SerSerVal PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 

mm rrrATrCAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
: ArgneSS pSeSIlPro MaGluValThr PheGluAla GluTyrAsp LeuGluArgAla 

1861 CAGAAGGCCG TGAACGAGCT GTTCACCAGC A ^^^^p*^^ T TgQ?y^y^^ V gYhrA^pVa? 
GlnLysAla valAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspval 

1Q21 ACCGACTACC ACATCGATCA GGTGAGCAAC CTGGTGGAGT GCTTAAGCGA CGAGTTCTGC 
SSlp^r HisIleLp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

l0fi1 rTGGACGAGA AGAAGGAGCT GAGCGAGAAG GTGAAGCACG CCAAGCGCCT GAGCGACGAG 
SSS^iS \£s£ysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 
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* 2041 CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCATCAACC GCCAGCTGGA CCGCGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

2101 CGAGGCAGCA CCGATATCAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAACTACGTG 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVax 

2161 ACCCTGCTGG GCACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA GATCGACGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGlu 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

22 81 CTGGAAATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCACC 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrValAsn ValProGlyThr 

2 341 GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC CCCATCGGCA AGTGCGGGGA GCCGAATCGA 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysGly GluProAsnArg 

Z4C- TGCGCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

2 4 61 TGCGCCCACC ACAGCCACCA CTTCAGCCTG GACATCGACG TGGGCTGCAC CGACCTGAAC 
CysAlaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys ThrAspLeuAsn 

2 521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG ATCAAGACCC AGGACGGCCA CGCCCGCCTG 
GluAspLeu GiyValTrp ValllePheLys IleLysThr Glr.AspGly HisAlaArgLeu 

2 581 GGCAATCTAG AATTTCTCGA AGAGAAACCA TTAGTAGGAG AAGCACTAGC TCGTGTGAAA 
GlyAsnLeu GluPheLeu GluGluLysPro LeuValGly GluAlaLeu AlaArgValLys 
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30 61 GTTCGTGTCT GTCCGGGTCG TGGCTATATC CTTCGTGTCA CAGCGTACAA GGAGGGATA7 
ValArgVal CysProGly ArgGlyTyrlle LeuArgVal ThrAlaTyr LysGluGlyTy r 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGly CysValThr IleHisGluIle GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA TTATACTGC3 
AsnCysVal GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrA-a 

32 41 ACTCAAGAAG AATATGAGGG TACGTACACT TCTCGTAATC GAGGATATGA CGGAGCCTAT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3 301 GAAAGCAATT CTTCTGTACC AGCTGATTAT GCATCAGCCT ATGAAGAAAA AGCATATACA 
GluSerAsn SerSerVal ProAlaAspTyr AlaSerAla TyrGluGlu LysAlaTy rThr 

^361 GATGGACGAA GAGACAATCC TTGTGAATCT AACAGAGGAT ATGGGGATTA CACACCACTA 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyAsp TyrThrProLeu 

34 21 CCAGCTGGCT ATGTGACAAA AGAATTAGAG TACTTCCCAG AAACCGATAA GGTATGGATT 
ProAlaGly TyrValThr LysGluLeuGlu TyrPhePro GluThrAsp LysValTrpIle 

34 81 GAGATCGGAG AAACGGAAGG AACATTCATC GTGGACAGCG TGGAATTACT TCTTATGGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAspSer ValGluLeu LeuLeuMetGlu 
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1 je^^^ls^ s^^s^-sssss 

^rrr»riT TTAPACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
AGCCAGCTGA CCCGCGAGAT TTACACOA/U- ^ GluAsnPhe AspGlySerPhe 

SerGlnLeu ThrArgGlu HeTyrThrAsn ProValLeu GluAsnrne asrv, y 

„„ rTSP1( , rrAPCCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
AACAGCATCA CCATCTACAC CGACGCCCAU ^r?vriu TvrTvrTrp SerGlyHisGln 
AsnSerlle ThrlleTyr ThrAspAlaHis ArgGlyGlu TyrTyrrrp *e y 
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1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGG6CCJGG6 AGTJTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC <^™ CA .?S!iSSS C SrSSSSSIl 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

^201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
™rArg£ys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

12 61 CCACCTCGAC AGGGCTTCAG CCACCGTCTG ^^FsSSlS!!? j^SjEKe 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe Argi>erbiyr ne 

-321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAla 

1351 GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln IleThrGln IleProLeu ThrLysSerTh. 

14 41 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

»" SSS^f sSf^ S^ISS^ c r SSKKSS 

-.621 ATCCACGGCC GCCCCATCAA CCAGGGCAAC "« CC J«\CCATGAGCAG 

IleAspGly ArgProIle AsnGlnGlyAsn pheSerAla ThrMetSer serGlyserasr. 

Arg?leG^u^PheValPro'*Ala^uValThr' pheG^uA^a^G*uTyrA*p^*LeuG?uArgAla 

ssifssrs ssssf-sssr » 
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2 041 CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCATCAACC GCCAGCTGGA CCGCGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPbeArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

* 2101 CGAGGCAGCA CCGATATCAC CATCCAGGGC GGGGACGACG TGTTCAAGGA GAACTACGTG 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVai 

2161 ACCCTGCTGG GCACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA GATCGACGA3 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGlu 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

22 81 CTGGAAATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCACC 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrValAsn ValProGlyThr 

2 341 GGCAGCGTGT GGCCCCTGAG CGCCCCCAGC CCCATCGGCA AGTGCGGGGA GCCGAATCGA 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysGly GluProAsnArg 

24 01 TGCGCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

24 61 TGCGCCCACC ACAGCCACCA CTTCAGCCTG GACATCGACG TGGGCTGCAC GGACCTGAAC 
CysAlaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys ThrAspLeuAsn 

2521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG ATCAAGACCC AGGACGGCCA CGCCCGCCT3 
GluAspLeu GlyValTrp ValllePheLys IleLysThr GlnAspGly HisAlaArgLeu 

2581 GGCAATCTAG AGTTCCTGGA GGAGAAGCCC CTGGTGGGCG AGGCCCTGGC CCGCGTGAAG 
GlyAsnLeu GluPheLeu GluGluLysPro LeuValGly GluAtaLeu AlaArgValLys 

2 641 CGCGCCGAGA AGAAGTGGCG CGACAAGCGC GAGAAGCTGG -AGTGGGAGAC CAACATCGTG 
ArgAlaGlu LysLysTrp ArgAspLysArg GluLysLeu GluTrpGlu ThrAsnlleVal 

27 01 TACAAGGAGG CCAAGGAGAG CGTGGACGCC CTGTTCGTGA ACAGCCAGTA CGACCGCCTG 
TyrLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGln TyrAspArgLeu 

27 61 CAGGCCGACA CCAACATCGC CATGATCCAC GCCGCCGACA AGCGCGTGCA CAGCATTCGC 
GlnAlaAsp ThrAsnlle AlaMetlleHis AlaAlaAsp LysArgVal HisSerlleArg 

2 821 GAGGCCTACC TGCCCGAGCT GAGCGTGATC CCCGGCGTGA ACGCCGCCAT CTTCGAGGAA 
* GluAlaTyr LeuProGlu LeuSerVallle ProGlyVal AsnAlaAla IlePheGluGlu 

2 881 CTCGAGGGCC GCATCTTCAC CGCCTTCAGC CTGTACGACG CCCGCAACGT GATCAAGAAC 
LeuGluGly ArgllePhe ThrAlaPheSer LeuTyrAsp AlaArgAsn VallleLysAsn 

2 941 GGCGACTTCA ACAACGGCCT GAGCTGCTGG AACGTGAAGG GCCACGTGGA CGTGGAGGAG 
GlyAspPhe AsnAsnGly LeuSerCysTrp AsnValLys GlyHisVal AspValGluGlu 

3001 CAGAACAACC ACCGCAGCGT GCTGGTGGTG CCCGAGTGGG AGGCCGAGGT GAGCCAGGAG 
GlnAsnAsn HisArgSer ValLeuValVal ProGluTrp GluAlaGlu ValSerGlnGlu 
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30 61 GTGCGCGTGT GCCCCGGCCG CGGCTACATC 
ValArgVal CysProGly ArgGlyTyrlle 

3121 GGCGAGGGCT GCGTGACCAT CCACGAGATC 
GlyGluGly CysValThr IleHisGluIle 

3181 AACTGCGTGG AGGAGGAGGT GTACCCCAAC 
AsnCysVal GluGluGlu ValTyrProAsn 

3241 ACCCAGGAGG AGTACGAGGG CACCTACACC 
ThrGlnGlu GluTyrGlu GlyThrTyrThr 

3 301 GAGAGCAACA GCAGCGTGCC CGCCGACTAC 
GluSerAsn SerSerVal ProAlaAspTyr 

3 3 61 GACGGCCGCC GCGACAACCC CTGCGAGAGC 
AspGlyArg ArgAspAsn ProCysGluSer 

34 21 CCCGCCGGCT ACGTGACCAA GGAGCTGGAG 
ProAlaGly TyrValThr LysGluLeuGlu 

34 81 GAGATCGGCG AGACCGAGGG CACCTTCATC 

GluIleGly GluThrGlu GlyThrPhelle 

35 41 GAGTAG 



CTGCGCGTGA CCGCCTACAA GGAGGGCTAC 
LeuArgVal ThrAlaTyr LysGluGlyTyr 

GAGAACAACA CCGACGAGCT CAAGTTCAGC 
GluAsnAsn ThrAspGlu LeuLysPheSer 

AACACCGTGA CCTGCAACGA CTACACCGCG 
AsnThrVal ThrCysAsn AspTy rThrAla 

AGCCGCAACC GCGGCTACGA CGGCGCCTAC 
SerArgAsn ArgGlyTyr AspGlyAlaTyr 

GCCAGCGCCT ACGAGGAGAA GGCCTACACC 
AlaSerAla TyrGluGlu LysAlaTyrThr 

AACCGCGGCT ACGGCGACTA CACCCCCCTG 
AsnArgGly TyrGlyAsp TyrThrProLeu 

TACTTCCCCG AGACCGACAA GGTGTGGATC 
TyrPhePro GluThrAsp LysValTrpIle 

GTGGACAGCG TGGAGCTGCT GCTGATGGAG 
ValAspSer ValGluLeu LeuLeuMetGlu 
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1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA JCTGCCTGAG CAACCCCGA3 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProG.. 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLe, 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCT3 
SeSShr GlnPheLeo LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAspHe IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ^CAGGCCAT CAGCCGCCT3 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAla IleSerArgLe- 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT JCCGCGAGTG GGAGGCCGA: 
GluGlyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaA_._ 

36- CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC 

Pro^n ProSaieu ArgGluGluMet ArglleGln PheAsnAsp HetAsnSerA.a 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGMCTACC AGGTGCCCCT GCTGAGCGTo 
LeuThrThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVa. 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTCAGCGT GTTCGGCCA3 
TyrValGlr. AlaAlaAsn LeuHisLeuSer Va.leuArg AspvalSer Va.PheuxyG... 

,,. rrr „ rrrrCT tcgacgcCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 

A?g^pGly pSaspA^ AlaThrlleAsn SerArgTyr AsnAspLeu ThrArgLeuIle 

721 CTGGACATCG TGAGCCTGTT CCGCAACTAC g^^ggS^^g^gSSS 
LeuAspIle ValSerLeu PheProAsnTyr AspSerArg ThrTyrPro neArgin 

o i irrrirrTfA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
781 S?SSS?TSSSS^n«TyrThrA.n ProValLeu GluAsnPhe AspGlySerPhe 

rrrrcc AGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
ArgSySer A^aSnG^ IleGluGlySer IleArgSer ProHisLeu MetAspIleLeu 
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1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC J^GCCAGGG AGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrAr, 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT n C ^ C ?TS^?ST^^sSGln§lS" 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLe- 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVa. 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
1201 TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVa. 

12 61 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
Pr^ProArJ GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA -TATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAlo 

TAGTTCAACA ACATCATCCC CAGCAGCCAG AICACCCAGA TCCCCCTGAC C AAGAGCAC Z 

" SupKaIo \innene ProSerSerGln IleThrGln IleProLeu ThrLysSerThr 

1441 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAsplleLeu 

15C1 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC ^ CCCCCTGAGC 
ArgArgThr SerProGly Glr.I leSerThr Le^ArgVal Asnl.eThr «. a F-oLeuSe. 

rACCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
SnArg^f ArgSaSg IleArgTyrAla SerThrThr AsnLeuGln PheH.sThrSer 

1621 ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
IleAspGly ArgProIle AsnGlnGlyAsn PheSerAla ThrMetSer SerGlySerAs.. 

rTCClCACCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
SuSSer GlJserSe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT ^ACATCGAC 
SerSerVal PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 



1681 



1741 



1801 

't 

? 



1861 

> 



1921 



1981 



CGCATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
SgneSS PheValPro AlaGluValThr PheGluAla GluTytAsp LeuGluArgAla 

CAGAAGGCCG TGAACGAGCT GTTCACCAGC AGCAACCAGA JCGGCCTGAA GACCGACGTG 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

ArrrAPTACC ACATCGATCA AGTATCCAAT TTAGTTGAGT GTTTATCTGA TGAATTTTGT 
i£5Sp?JS HtsneSp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

TTCrATrAAA AAAAAGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
SSSpS^L^iJJSS LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 
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* 204 1 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAACTAGA CCGTGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

»* 2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGTT 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVa. 

2161 ACGCTATTGG GTACCTTTGA TGAGTGCTAT CCAACGTATT TATATCAAAA AATAGATGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGl^ 

2221 TCGAAATTAA AAGCCTATAC CCGTTACCAA TTAAGAGGGT ATATCGAAGA TAGTCAAGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

22 81 TTAGAAATCT ATTTAATTCG CTACAATGCC AAACACGAAA CAGTAAATGT GCCAGGTACG 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHi sGlu ThrValAsn ValProGlyThr 

2 341 GGTTCCTTAT GGCCGCTTTC AGCCCCAAGT CCAATCGGCA AGTGCGGGGA GCCGAATCGA 
2341 5^ Leu TrpProLeu SerAlaProSer ProlleGly LysCysGly GluProAsnArc 

2401 TGCGCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAA3 
CysAlaPro Hi sLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

24 61 TGCGCCCACC ACAGCCACCA CTTCAGCCTG GACATCGACG TGGGCTGCAC CGACCTGAAC 
CysAlaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys ThrAspLeuAsn 

2521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG ATCAAGACCC AGGACGGCCA JGCCCGCCTG 
- • -.a.spLe'- GlyValTrp Val I lePheLy s IleLysThr GlnAspGly rij.s rt .a rt rgLe_ 

, rrrAATCfAG AATTTCTCGA AGAGAAACCA TTAGTAGGAG AAGCACTAGC TCGTGTGAAA 
G^nLeu G^pKSeu GluGluLysPro LeuValGly GluAlaLeu AlaArgValLys 

2641 AGAGCGGAGA AAAAATGGAG AG AC AAACG T GAAAAATTGG ^GGGAAAC AAATATTGTT 
ArgAlaGlu LysLysTrp ArgAspLysArg GluLysLeu GluTrpGlu ThrAsnlieva. 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCT TTATTTGTAA ACTCTCAATA JGATAGATTA 
TyrLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGln TyrAspArgLeu 

2761 CAAGCGGATA CCAACATCGC GATGATTCAT GCGGCAGATA AACGCGTTCA ™GCATTCGA 
GlnAlaAsp ThrAsnlle AlaMetlleHis AlaAlaAsp LysArgVal HisSerlleArg 

->*->^ ffihrrTTlTP TrCCTGAGCT GTCTGTGATT CCGGGTGTCA ATGCGGCTAT TTTTGAAGAA 

i 28 LeSKoSS LSserVallle ProGlyVal AsnAlaAla IlePheGluGlu 

?flRl TTAGAAGGGC GTATTTTCAC TGCATTCTCC CTATATGATG CGAGAAATGT CATTAAAAAT 
, 28 leu^Sly ArgllePhe ThrAlaPheSer LeuTyrAsp AlaArgAsn VallleLysAsn 

' QJ1 rr TPATTTTA ATAATGGCTT ATCCTGCTGG AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
294 GlyAsp™A^Gl7LuSerCysT^ AsnValLys GlyHisVal AspValGluGlu 

"^01 CAAAACAACC ACCGTTCGGT CCTTGTTGTT CCGGAATGGG AAGCAGAAGT GTCACAAGAA 
30 G^SSn H^sArgSer ValLeuValVal ProGluTrp GluAlaGlu ValSerGlnGlu 
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™si GTTCGTGTCT gtccgggtcg tggctatatc cttcgtgtca cagcgtacaa ggagggatat 

3061 C ySroSy ArgGlyTyrlle LeuArgVal ThrAlaTyr LysGluGlyTyr 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGly CysValThr IleHisGluIle GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA TTATACTGCG 
AsnCysVal GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrAla 

32 41 ACTCAAGAAG AATATGAGGG TACGTACACT TCTCGTAATC GAGGATATGA CGGAGCCTAT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3 301 GAAAGCAATT CTTCTGTACC AGCTGATTAT GCATCAGCCT ATGAAGAAAA AGCATATACA 
GluSerAsn SerSerVal ProAlaAspTyr AlaSerAla TyrGluGlu LysAlaTyrThr 

3361 GATGGACGAA GAGACAATCC TTGTGAATCT AACAGAGGAT ATGGGGATTA CACACCACTA 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyASp TyrThrProLeu 

34 21 CCAGCTGGCT ATGTGACAAA AGAATTAGAG TACTTCCCAG AAACCGATAA GGTATGGAT7 
ProAlaGly TyrValThr LysGluLeuGlu TyrPhePro GluThrAsp LysValTrplie 

34 81 GAGATCGGAG AAACGGAAGG AACATTCATC GTGGACAGCG TGGAATTACT TCTTATGGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAspSer ValGluLeu LeuLeuMetGl-u 
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5808,BamHI 
5666,EcoRV 
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PCR fragment replacing Pith Nco I (ATG start) 
with Bam HI. Pith-5'-GGATCCAACA(ATG)-3'-syn.-Bt 
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5369-BamHI 

6232,XbaI r 
6226,SalI 
6220,PstI 
208,H»ndIII 

5730,SalI 




Pstl, 1603 
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Hr ylAfb) Protein Levels in Transgenic Maize 



ELISA Bt Values of Field Plants: 



INBRED X PARENT ABRU PLANT Number ng Bt/mg protein 



2ND01X171-4A 


1646 


5N984X171-4A 


857 


5N984X171-4A 


870 


5N984X171-13 


969 


5N984X171-15 


U68 


5N984X171-15 


1470 


5N984X171-14A 


1502 


5N984X171-14A 


1529 


5N984X176-1 1 


1667 


5N984X176-1 1 


1671 


5N984X176-1 1 


1673 


5N984X176-1 1 


16^5 


5N984X176-1 1 


1679 


5N984X171-4B 


1942 


5N984X171-4B 


1946 


5NA56X171-16ABX 


1 101 


5NA89X1 76-11 


1622 


5NA89X176-11 


1630 


5NA89X176-11 


1635 


6F010X171-4 


825 


6F010X171-4 


832 



29 
1705 
1760 

22 

17 

28 
180 
1500 
408 
1270 
1522 
943 
967 

15 

16 

30 
959 
1 172 
1 100 

103 
1298 



-Bt levels are in ng crylA(b) mg total protein 

-Data are from progeny of the described maize informants exoressing the crylA(b) 
protein. 

•ELISA analysis of transgenic plant material was carried out using standard procedures 
as described elsewhere. 
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p.ryiAfh ) Protein Le y^jnJ>ans nenic Maize . 



Greenhouse plants 



35S LINE 

6F010 x 171-4A 
5N984 x 171-14A 
6F010 x 171-16AB 
5N984 x 171-13 
5NA89 x 171-13 
5N984 x 171-18 
6N615 x 171-16AB 

PEPC LINE 

6N615 x 176-11 
6F010 x 176-10 
5N984 x 176-1 1 



-Bt levels are in ng crylA(b)/mg total protein 

< »ho M««=rrihPd maize Iransformants expressing the ay\Mb) 
Data are from progeny of the described maize 

protein. 

lor,, material was earned out using standard procedures 
ELISA analysis of transgenic plant material was cameo 

as described elsewhere. 
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EXPRESSION OF THE CRY.A(b) GENE IN TRANSGENIC MAIZE US.NG THE PITH- 
PREFERRED PROMOTER 

Leaf sanies Ton, «« P^™'- -^fTS^SS5 
European corn borer bioassay. 



expression. 



PLANT NUMBER ng cylA(b)/mg protein 

JS21A-1 TOP 169 

JS21A-2TOP 0 

JS21A-3TOP 1^3 

JS21A-1 1 TOP 127 

JS21A-12TOP 1 12 

JS21A-13TOP ? 7 

JS21A-14TOP 11 8 

JS21A-19TOP 82 

JS21A-24TOP 0 

JS21A-28TOP 154 

JS22D-3 MID 2946 

JS22D-4 MID 5590 

JS22D-1 1 M«D 

JS22D-17MID 3004 



fn 



66/94 



WO 93/07278 



PCT/US92/08476 
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; CATTAAAGAAGTCTTTGAACAGATTCTAGAGATCTAGTTTAATGAGCTC 



60 



1 GAATTCGGATCCATTAAAC^^^™-— ttAGGG CAGGGGTACTGTTATGTTTT 120 
61 CCAAAAGTCTTGAAAAAATTCAGCGGGGAG^ 180 
121 AAAGAGAACACCACTTTCTTGATCTCTTCTAAAGAUAAA 24Q 



361 
421 



-285 

1501 
-225 



480 
540 



481 CAAGCTTTTCCAATCTATCAGAAioi^~ ATATTTC CCTTCGACATTTATAT 600 

541 TATCGATGTTATCCATAAGGTATTTCTTGAACTTCTTATATTTO gg0 

601 TCCATCCTTTCAACATTTTTTTGTTCAATCTTTTT^ 720 

6 61 TACATTTCCTGCTCCTCACAGGTAAGGACGAGCTTTCAAAAAA QGGACAGAG 7 80 

7 21 AGGTCTGAGCCTCCAGCAAAGCTCACATATCTAAAGTCCCTCT^ ^ 
ill TGAGTGCTAAGACACATGGGAACATGACCAGAAAAAA g00 

8 4 1 AACAATATTCTTGTACTGCAAGTCTCGTTATGG^ g g Q 
901 CAAATGTGTTGGGATGTCAAGTATATAGACTAT^CATGA 1020 

9 61 GCAGCTCAATTATAGAGTTGAATAAAGTGCTCCA^Aii TTTAAA GTGGCTCGC 10 8C 
1021 GTT AAGAT ATTAAAATCACCTCCC AC CAACAT "^TGCA^^^^^^ ^ 

111! ^^^^^S«SSSESS5S. 28 

1261 CAGCCAGCCATGTGGCACTCCTACGTATACTACGTGAGGTGAGA^TTCACTCACA^GGGAT 1320 
-465 

1441 TGTTTCATTATCTTTTATCATATATATATAACACATATTAAATGATTCTTCGTTCCAATT 1500 



TATAATTCATTTGACTTTTTTATCCACCGATGCTCGTTTTATTAAAAAAAATATTATAAT 1560 

1561 TATTGTTACTTTTTGTTGTAATATTGTTTAGCATATAATAAACTTTGATACTAGTATGTT 1620 

1621 TCCGAGCAAAAAAAAATATTAATAT TTAGAT TACGA G^f^ftTTAATTA&XTATATTCG AG 1680 
-105 

CTTGCTATAAAC6AGGCAGCTAGACGCGACTCGACTCATCAGCCTCATCAACCTCGACGA 1800 

*^re&ffreAACGAACGGACAGGTTGTTGCACAGAAGCGACAT<^CTTTCGCGCCCAAAACGT l». 

+76 



38 
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22 81 GCGA 



E 



wCGGGCCCATCATCCAGGCGTCGGTGGCGCGG 



2341 GGGGTACCCTGCTCGGACCCCTACATCGACGGGCC;lailaiuuh^^uuiv>w* R 
G V P C S D P Y ID G P I l U 

2 4 01 GCTCTGGCCAGCGGC^CCACCATGGACGCCGTGCTG^A^i^x^™^*^-^ 
ALA SGTTMDAVLEMLRE - 



2521 



18 61 ri~n*r:nTCCTCCTCGCTGTCCTCGGCGTTGCAGGCAGCTCAG^CGCCGCCGCTGCTCCTGA 192 = 

+ 126 S S S S E S S ALQAAQSFfx, 

1921 GGCGGATGTCGTCGACCGCAACACCGAGACGGAGGTACGACGC GGCCGTCGTCGTCACTA 198C 

. +196 RMSSTATP RRRYDAAVVV 

1981 CCACCACCACTGCTAGAGCTGCGGCGGCTGCTGTCACGGTTCCCGCCGCCCCGCCGCAGG 2 0 4 C 

+256 T TTA-RAAAAAVTVFAA' 

2041 CGGGCCGCCGCCGCCGGTGCCACCAAAGCAAGCGGC 2100 

+ 316 G R R R R C H Q S K R R H P U 

2101 CGGTGTCGGACACCATGGCGGCGCTCATGGCCAAGGGCAAGGTTCGTATAGTACGCGCGC 

+ 376 VSDTMAALMAKGK 

2161 GTGTCGTCGTCGTTATTTTGCGCATAGGCGCGGACATACACGTGCTTTAGCTAGCT 

2221 LtAGATGATCGGTGCAGACGGCG™ 



2160 



222 C 
228C 



CGACGGCCGAGGCGCTGCGTCTGCTGGACGGCTGTGGCGCCGACGTCATCGAGCTG- 2 3 4C 

taealrlldgcgadvi 



2400 



246: 



gccgtgctggagatgctgagggaggtgacgccg 

CCTCTCCTACTACAAGCCCATCATGTCTCGCAGCT.TG 252! 



24 61 gagctgtcgtgccccgtggtgctc^ 

ELSCPVVLLS^XKPi" 

GCCGAGATGAAAGAGGCGGGGGTCCACGGTAACTATAGCTAGCTCTTCCGATCCCCCTTC 

GLIVPDl.PYVAAHSL.wo 
2701 AACAACAACCTGGAGCTGGTAGGTTGAATTAAGTTGATGCATGTGATGATTTATGTAGCT 
N N N L E L 

27 61 AGATCGAGCTAGCTATAATTAGGAGCATATCAGGTGCTGCTGACAACACCAGCCATACCA 
2 621 GAAGACAGGATGJ^GGAGATCACCJAGGCTTCAGAAGGCTTCGTCTACCTGGTAGTTATA 

l«i S"S^«=SSSSS 3000 

3001 gtgagcgtgaacg^gacaggicctcgcgcaaacgtoaacccacga™ 
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f 3061 ATCCAGGAGGTTAAGAAGGTGACTAACAAGC^C 

3121 CCCGAGCACGTGAAGCAGGTACGTACGTAGCTGACCAAAAAAAACTGTTAACAAGTTTTG 
* 31 8 1 TTTGACAAGCCGGCTACTAGCTAGCTAACAGTGATCAGTGACACACACACACACACAGAT 

32 41 TGCGCAGTGGGGCGCTGACGGGGTGATCATCGGCAGCGCCATGGTGAGGCAGCTGGGCGA 
AQWGADGVI I GSAMVRQ L^^ 

3301 AGCGGCTTCTCCCAAGCAAGGCCTGAGGAGGCTGGAGGAGTATGCCAGGGGCATGAAGAA 
AASPKQGLRRLEEXAi^o 



A L P AAAAATAAGCCAAGTGGAAGTGAAGCTTAGCT 



3 3 61 CGCGCTGCCATGAGTCCATGAC 
ALP 

34 21 AT CATC T C GGAG AAG AC G AC CG AC C AA T 

TTCCGG. 

;TGGAGCTACTAGCTAGCTGGCCGTTC 



3601 ATTCGATTATAATTCTTCGCTCTGCTGTGGTAGCAGATGTACCTAGTCGATCTTGTACGA 

3 6 61 CGAAGAAGCTGGCTAGCTAGCCGTCTCGATCGTATATGTACTGATTAATCTGCAGATTGJ1 

3721 ATAAAAACTACAGTACGCATATGATGCGTACGTACGTGTGTATAGTTTGTGCTCAT 

Till ifM^CACC^ 
3841 



4681 



3120 

3180 
3240 

3300 

3360 



+++ AAAG T AAAAC G T AC AGAGAC AC T T GAT AAT ATCT AT C T 3 4 20 



3481 GTATATACACCGTACGTCGTCGTCGTCGTTCC ~» 
3 5 4 1 AACGTGTACGTAGTAGTATGTAATGCATGGAGTG 



3480 

3540 
360: 
3660 

3720 



3780 
3840 

3900 
396C 



CCTTCTTT G AC AG AC AC AC C AC C AC C AG C AGCAG T G AC r A r AAA T C T GG AGC AC T G GG 

3 9 01 GACATGT AAC^T AT TTTAAGAGGT AT AAGAT AC^ 4 020 

3 9 61 AT AT TGCAAAGACAAAAAAAAAACAAAAT T^ C TGCT CAAAT AGATCGAT A 4 0 80 

4 021 GG T GGATG GAT TAAAAT TAT CT 

4 0 81 TTCAGATCTATGTTGTATAGCCTTTTCATTAGCTU l u " ™?CGTACTTGTTTGTGGCCC 4 20 0 

till ^ssssssssssssssssss^^^»«sss. 'Ill 



4 321 AACTGGCTAGGACTCGCCCATTTAATAAGAt-AAA<«-«.Ji ^^^"JvCl^maTiTTATTGTATA 4 4 40 

J 3 8 1 ATTTAAAATATATTGTTTATTTGTGAAATGTGTATTATTTGTAATATA J J ^ 

4 441 TAG 

4 501 TTG 

4561 GGG 

4 62 1 S£SE3££S5£^ 



«« "SgS^SSSJ^gg^^SSSg^c*™* .... 



4740 
4800 



4 7 41 GTGGTGTGATTAGAGATGGCGATGGGAACCGATCACTGATTCCGTGTGGAGAATTCGATA ^ 
4 801 TCAAGCTTATCGATACC 

Figure 24 . 

Entire sequence of the maize J^^J^/ttrat^'sJart and 
and exons, transcription and translation 

stop of cDNA. . . . x. j. . 73******* = 

S /start and end of cDNA; *1 - t rans c«ptxon jt.rt ^73 = 

primer extensxon pr-J^ t J£?b£ poly A addition site. 
# above underlined sequences are PCR primers. 
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2110 5N984 

60d 70^ 

P C BR ES LP MP UP C BR ES LP MP UP 




21 1D 5N984 

— — K " " 1 

39d 46d 

, ;i ' 

S L R P SH LP UP LP MP UP 




Northern blot showing differential expression of 
TrpA gene in maize tissues. 2 hour exposure against 
film at -80C with :opcnt Cronex intensifying screens 
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FIG. 25B 



211D 5N9S4 



60d 70d 



P C BR ES LP MP UP C BR ES LP MP UP 




2 -i -id 5N984 

3 9d 46d 

l II " i 

S L R P SH LP UP LP MP UP 






r 

3 



r. 



Northern bio- shewing differential expression of 
TrpA gene in maize tissues. 4 hour exposure against 
film at -80C with Dupont Cronex intensifying screens 
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FIG. 25C 



2110 5N9B4 

60d 70d 

P C BR ES LP MP UP C BR ES LP MP UP 



211D 5N984 

^^^^^^^^^ 

3 9d 46d 



i ii 

S L R P SH LP UP LP MP UP 




Northern blot showino differential expression of 
"III gene in maize tissues. 18 hour exposure against 
film It -8 0C with Dupont Cronex intensifying screens. 
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FIG. 25D 



21 iD 



5N984 



60d 



70d 
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• * 



0 



r 



2 1 1D 



5N984 



-, r 



39d 



46d 



I il 

S L R P SH LP UP LP MP UP 




Northern blct shewing differential expression of 
TrpA c«ne in mai-e -issues. 48 hour exposure against 
film at -BOC with Cupont Cronex intensifying screens 
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2 1 1 D 5N984 
L P L P 



211 D 
S1 S2 S3 S4" S5 





Funk lines ^11D a ^a - °^ total RNA . 

of expression in -t.-D s. a t--g C C with Dupont Cronex 

65 hour exposure against a- 

intensifying screens. 
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10X 5X 1X S H EV E B 



9.4- 
4.3- 



2.0 



1.2 — 



<Z3 



. . „ _ * p^nw s n e 2 1 ID probed with the 
Genomic southern c_ Funk ^ne z o t - F rnRV 

TrpA cDNA 8-2. B = BamHI, E = EcoRI, EV - EcoRV, 
h = Hindlll and S = SacI . 

120 nour exposure against film at-80C with Dupont Crone, 
intensifying screens. 
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T 
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C 
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1 



Primer extension showing the transcription start of TrpA 

gene and sequencing ladder 80C witn Dupon t Cronex 

1 hour exposure against film at buu wi^n F 
intensifying screens. 
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RNase protection of region from -2 bp to -33" bp 
with three annealing temperures . 

?6 hour exposure against film at-BOC with Dupont Crone 
intensifying screens . 
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EcoRI 



1.0 kb 



EcoRI 




EcoRI 
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Maize Pollen CDPK cDNA sequence 

sequence contained in clones pCIB3168 and pCIB3169 



, iTr ATG CAC CAC CTC TCC GGC CAG CCC AAC GTG GTG GGC CTC CGC GGC SCG 

1 7>G^ ^ IK hTs STs Ser G.y Gin Pro Asn Va. Va. G. y Leu A rg Gl y 

_„ „ iTr ere cag Arc CAC ACC TGC CAC TCC ATG GGG GTG ATG CAC C3G GAC 

^►X^ A?; ^ S? G^n ^.S? His Thr Cys His Ser Me. Gl y Va I Me, H, s Arg Asp 

228 ATC AAG CCC GAC AAC TTC CTG CTG CTC ACC AAG GAC GAG GAC GCG CCC CTC AAG « 

-.hie Lys Pro Gl u Asn Phe Leu Leu Leu Ser uys Asp Gl u Asp Ala rro u y 

, B ' r- ^ GGC CTC TCC CTC TK TTC AAG SAG GGC GAG CTG CTC AGG GAC ATC GTC 

2 ll>TU< Sp Se G^ L^ £r Z Phe Phe Lys Gl u G.y Gl u Leu Leu Arg Asp Me Va, 

684 CTC AAG CAG TTC AGG GCC ATG AAC CAG TTC AAG AAA GCA GCA TTG AGG ATC ATA GOT 
228>Leu Lys Gin Phe Arg Ala Met Asn Gin Phe Lys Lys Ala Ala Leu Arg 

_ „„ j.-, j, rJ > Gen ctg AAG GAG ATG TTC AAG AAC ATT GAC 

741 GGG TGC CTA TCC GAA GAG GAG ATC ACA GGG CTG AAU Ajn ||# A sp 

247>Gly Cys Leu Ser Gl u Gl u Gl u He Thr Gly Leu Lys wu rve. y 

r x r rrc AAA CAC GGG TTG GCA AAG CAC 

zz^r^^^zzr.z^ «. - a,. «. 

855 .00 CCC MO c» ~ «c « « »» « - - ™ » ^ Asp K 2S 

285>Gly Pro Lys Leu Ser Asp Ser Gl u Met Gl u uys lbu rvw 



EcoRl 



912 GGC AAC GGG TTA ATT GAC TAC GAC GAA TTC CTC ACC GCA ACA GTC CAT ATC AAC AAA 
304>Gly Asn Gly Leu lie Asp Tyr Asp Gl u Phe Va I Thr Ala 
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Flo. j0 l C v_/' N T ) 

r , T AGA GAA G/\G CAC CTT TAC ACA GCA TTC CAG TAT Tlv. GAC AAG GAC AAC AGC 
^Leu" Z K SI *u His Leu Tyr Thr A. a Ph. G. n Tyr Phe Asp Lys Asp Asn Ser 

1n ,, (TfiC TAC ATT ACT AAA GAA GAG CTT GAG CAC GCC TTG AAG GAG CAA GGG TTG TAT GAC 
^ f" Lys Glu Glu Leu Gl u His A. a Leu Lys 0u Gin Gl y Leu Tyr Asp 

lOR* GC- GAT AAA ATC AAA GAC ATC ATC TCC GAT GCC GAC TCT GAC AAT GAT GGA AGG ATA 
Tl^s ?S L^^U L^Asp lie He Ser Asp Ala Asp Ser Asp Asn Asp G. y Arg lie 

nan ga- TAT tca gag ttt gtg gcg ATG ATG AGG AAA GGG acg gct GGT GCC GAG CCA ATG 
38u»£ P ™ 5? To VaT Ala Me. Me, Arg Lys G, y Thr Ala Gl y A.a Glu Pro Ma, 

1197 AAC ATC AAG AAG AGG CGA GAC ATA GTC CTA TAG TGAAGTGAAGCAGCAAGTGTGTAATGTAATGTG 
399>Asn Me Lys Lys Arg Arg Asp lie Va I Leu • • • 

0 

1340 AAAAAAAAAA 
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FIG - 31 



31 



inner leaf sheath 



inner leaf whorl 



green leaf 



anther 




pollen 



silk 



kernel 



pith 



root 



a 
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Similarity 
Index 



Gap 
Number 



Gap Consensus 

L «onth b 



Lipman-Pearson Protein Alignment 
Gap Penalty: 2, Gap Length Penalty: 12 
Seq1 Seq2 
pcH CDPK ptn rat 
1>S51 1>528 

po. CDPK ptn YSMGKELGRGQFGVTHLCTHRTSGEKLACKT I * * ^ J' J~ * EOVpDVRREVQ I HHHL^GQPNVVGLRGAYE '« 
.o, P*2 ptn YQLFEELGKGAFSVVRRCVKKTSTCEYAAK I I NTKKLSARDH-GKLEREAR I CRLLK-HPN I VRLHOSI S 8! 

po I CDPK ptn DKQSVHLVMELCAGGELFDR ! ^0**" y*^ E^** ^" j^? ' v^'^^H^'^P^PhRO:' KPEN^ L L '"SK^^P A ™ 
rot pk2 ptn EEGFHYLVFDL VTGGELFED I VARE YYSEADASHC 1 HQ I LeSVNH I HQHD I VHRDLKPENLLLASKCKGA IS- 

PC COPK ptn PUATDFC, LSVFPKEGELLR-0 1 VCSAYY UPEV, ^<-^^^\^V^^ «■ 
,ot p.2 Ptn "avKlXdFGL'a'i EVOGE'oQAWf'g'f'agV'pGyL'sPEVLRKDPYGKPVD I WACGV I LY I LLVGYPPFWDEDQHK 22" 

p,, COPK ptn ! FTA I LRGQLDLSSEPWPHI SPGAKOLVKKMLN \ ^p*^!"^* F q^-^hPV:' ^ P^^^*^"^ ^ LDN VV-1_ DRL 370 
, at D<2 ptn ■■?n 0 \ K ^ 0 OF^ 29 ' 



G 0: : S V : : P. A<: L: : : Ml 
1V0QUAGAYDFPSPEW0TYTPEA<NL INQML T 



□0 ZZ?< p;n <GFRAHNCF<KAALRM 387 

: : F. A. . : : K A L . : 
— n<2 n-- R<-NARR<.<GA:.-'M 308 
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UpmarvPearson Protein Alignment 
Gap Penalty: 2. Gap Length Penatty: 12 
Seq1 Seq2 
pol CDPK ptn humcama ptn 

1>551 1>1S0 



Similarity 
index 



Gap 
Number 



40.3 



Gap 
Length 



Consensus 
Length 



142 



pol CDPK ptr 



numcoma ptr\ 



LSEEEITGLKEMFKNI DKDNSGT I TLOELKHGL AKHGPKLSDSEMEKLME AADADCNGL I OYDEFVT A7V 46C 
i • EE* I* ' KE F : DKD. . GTIT . EL : . G. : : : E: : : : . , OADGNG 10: EF: T 

lteeqiaefkeafslfdkdgdgtittkelgtyhrsi gqnp TEAELQOM [ nevoaogngt I ofpefltmma 7a 



do! CDPK o;n H-MNKLDREEHL YTAFOYFDKDNSGY 1 TKEELEHAL<ECG -L YDADK I KD I I SDADSDNDGR ) DYSEF VA 528 

■ M: . 0. EE: : . AF: FDKO. :GYI: . EL H. : . : G I. : AO 0. DC: :: Y. EFV. 

'rkMKDTDSEEEIREAFRVFDKOGNGY I SAAELRHVMTNLGEKLTDEEVDEMIREAOIDGOGQVNYEEFVQ 14A 



humcoma ptn 



pol CDPK ptn MM 530 

MM 

humcomo ptr» MM I 46 



83/94 



WO 93/07278 

FIG. 34 



PCT/US92/08476 



Llpman-Pearson Protein Alignment 

Gap Penalty. 2, Gap Le^Penalty: 12 Gap Gap Consensus 

poTcDPK ptn soybean CDPK ptn Index Number Length Leng^ 

1 >509 62.4 1 



1>551 



pot CDPK otn 



soybeon CDPK ptn VLPQRT QN ! RE Y Y E VG 



YLGRPKEDVRATYSHGKELGRCOrGVTHLCT^ "«> 
11 L : _LL_.. : . ^ : ..:^..i RI(L GaGQFGTTFECTRRASGGKFACKS I PKRKLLCKEDYEDVVRE i 0 ! MHHLSE 9 ■ 



po> CDP'< ptn 



OPNYVGLRGAYEDKQSVHLVMELCAGGELFDR I I ARGQYTERGAAELL RA JY^'y^^^^Y 
• •ill* • I • lit' • I I I II M I : I M M I II : : : I : M : M : I : : M . 1:111 M 



H1HRDKP 220 
(III I I 



soybean CDPK ptn HANVVR ) EGT YEDSTAVHLVMELCEGGELFDR I VQKGHYSEROAARL I KT I V£ VVEACHSLGYMHRDLKP 61 

ENFLLLSKDEDAPLKATDFGLSVFFKEGELl RD ^ Y??^TT ^ Y V ? T^^? ^V^^*?Y^VT I ^V^^iY *° 



pol CDPK ptn i i i i i -ii i I | I I II I I * I II -I III: II Mill::. MM: 1 I 1:11: Ml: 1:11 

ENFLFDT 1 DEDAKLK ATDTGLSYFY ! I Y I LLSGV 23' 



soybeon ZZ°< ptn 



doi CDPK Ptn 



soyb.on C0P< ptn MESE*I KOLMDAADl Dk'sGT I DYGEFI AATVHLNKUEREENLVSAFSYFDKDGSGY I TLOE I QCACKOFGL 
pp. C3P< Ptn YDAOK 1 KD I I SOACSDNOGR I D^SEF"VAMMRKGTAGAE^MN^1 << 

s;rteo - ::=< c-.- - ==: - : ~« : <e : :~1:g= : ^be* aa--«sv;s'. gVr-'«r< w 



500 
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FIG. 35 



po\ CDPK gene Map (1 > 4165) Srte and Sequence 

Enzymes : 6 of 198 enzymes (Filtered) 

Settinqs ■ Circular, Certain Sites Onhv. Stand ard Genetic Code 

tt a pi- ' at at r Tr Trr &ATCGCTTGGGTTGGCACATTCTTAGCTTTT ATCACATT TT AAGAAAT AGAGTTCACC ACCTTC 



80 



AAAATATGCCT ATACAATGAATGATGCTTGGATGCAATATAGCTAGATTCAACT AGCTAT ATATGGTCAATAGAACCCTG 



60 



4 A^r Arr ttat aa at Arc Ar.TTrAATTTTGAGACCCTAAGCGAGTAAATGGTTAAAGTCCTCTTATTATTA-GTCTTAGG 



240 



AC 



TTPTPCTTGCTAAATGCTTGTCAGCGATCTATATATCTTCCCCACTGCGGGAGAT ACTATAT ATAGGGCCTTGGACCT 



320 



t Affect Atr TP A A ARGCCTAGTCAC AACAATTCTCAAC AGT ATTT AATTTT AT ACATGTATGAACAGTGTAGGAATTTG 



AG 



tcppp A APPP. AARAGTGGGAGGTGT AAATTGGGT AGCTAAACTTAAAT AGGGCTCTTCTT ATTTAGGTTTATCTAGTC 



400 



480 



TP T APTT ARACT A ATTCAG AAAGAATTTT ACAACCT ATGGTTAATC AT ATCTCT AGTCT AAGCAAATTT AGGAAAGTT AA 



AAGC 



ap ap A ATT AGGCACATGTGAAAGATGTGT ATGGTAAGT AAAAG ACTTAT A AGG AAAAAGTGGGTG AATCCTCAAGA 



560 



640 



TGTGGTGGTATATCCCAATGATATTAGATGCCAGAATATA GGGGGGAAATCGATGTATACCATC TCTACCAGGATACCTG 

■ . ■ - » ' ■ ■ -■ ' ' i i . ■ ■ * ■ - ■ ■ 1 ■ * *- 



TGC 



r*n aptctrp A APTRAP ACATGG ACC ATGGTGTCTTCTT AGATTTGGTTATT AGCTAATTGCGCTAC AACTTGTTCAA 



800 



RRP-TAGACCAAATTAAAAAACTAATATT AAACATAAAAAGTTAGGCAAACTATAGTAAATTATGC AGCGATCCAACAACA 



880 



AGCC 



, TnT . T ^T^rTrATRARPPAPRPRTPGGCCATACACCCACATGATGTTTCCATACGGATGGTCCTTATGCAATT 



TTGTCTGCAAAA 



p ap A ARPPTTAAT AC AGCC ACGCGAC AATCATGGAAGTGGTCGTTTT AGGTCCTC ATCATGAAGTTC A 



960 



1040 



RGAAAAPRC ATC AAATGTAATGCAGAGAAATGGTATTTCT TCTCTTGTAAATC AGGGAGAGGAGTACCATCAGTAC AGA ^ 



EcoRI 



' ttp ARAATPARAATTC AGTCTTCCAACGAC AATAATCGCAGCATCT TGT AAAAATTTGCAGAAACTTCTGTTTGACTTGT 



200 



AGCCCTGACCTTTGCAAATATTTGAAGTTGTGCCTGCTGACACAACTTCAATCTGGAAG TGCTGTTGATCAGTTTTGCCA ^ 
RAAACAGCAAGCAGCCTATAT ATAT CTGTCACGAGACACCCTGCCGCCCTCTTCTTTCCCGCCATTCCCTCCCTACCCT T 

■ ■ " — — ■ — ■ 
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Site and Sequence 



riCPPKqcne Map (1 >4165] 
r a a a * TrT ac a a ArrTTTTTTTTTCCTCCCGAT ACGCCCCTC CAT CTCTCGCCGTTCATQT CCGT'GGCTGGCTGCCCTCC ^ 




l m RNAstart J 



^TnrzcAcrAnnrRGCCGCACTCGTTCCCCGCCGC AGCCATGGGCCAGTGCTGCTCCAAGGG CGCCGGAGAGGCCCCGCCA ^ 
. - i i » ■ ■ — . ■ i ... i ' 11 



EXON 1 



rrgARKfRr-CAAACGGCAGGCGCCAAGCCGCGGGCGTCCGCGAAC AACGCCGACGGACAACGGGCGTCGTCCTCGTCCBC 




f^r Ti^i^r TrrTrrr r;r Tnr mr TRrrRRTGGTGGTGGCGGCGGCACGACGAAGCCGGCCTCACCCACCGGCGGCGCCAGGG 




CCAGC 



Trrticr acp A A ArrRGCGCCCGCCGTGGGCACGGTGCTGGGCCGGCCCATGGAGGACGTGCGCGCGACGTACTCG 




Ava I 



ATGGCCAAGGAGCTCGGGCGCGGGCAGT TCGGCGTGACGCACCTGTGCACGC ACCGGACGAGC GGCGAGAAGCTGGCGTG ^ 

i . ... - - I ■ . ■ ■ \ ■ - ' ' — . 




EXON 1 



CAAGACGATCGC 



n a AnrgcA ARPTRRrGGCCAGGGAGGACGTGG ACGACGTGCGGCGGGA6GTGC AG ATCATGCACC ACC 




TrTrrrnrr Ar?r rr A ArcTnGTGGGCCTCCGCGGCGCGTACGAGGAC AAGCAGAGCGTGC ACCTCGTCATGGAGCTGTGC ^ 




EXON 1 



Ava I 



' GCGGGCGGGGAGCTC 



TTCG * rr ^ r A Tr ATr GrrTGGGGCC AGT ACACGGAGCGCGGCGCCGCGGAGCTGCTGCGCGCCAT 




2080 
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>l CDPK pane Map (1 > 4165] 



FIG . 35 (CONT) 
Site and Sequence 



Ava I 



CBTBrAGATCGTGCACACCTGCCACTCCATGGGGG TGATGCACCGGGACATCAAGCCCGAGAACTTCCTGCTGCTCAGCA 

- ' ■ ■ . t ■ < * ' ' 1 ' " 




EXON 1 



Aftc Arc ARB at Rrfirr.Rr.TrAAGGCCACCGACTTCGQCCTCTCCGTCTTCTTCAAGGAGGGCGAGCTGCTCAGGGAC ATC 




2240 



EXON 1 



Ava I 



CTrGRf AftrftrrTACTACATCGCGCCCGAGGTGCTCAAGAGGAAGTACGGCCCGGAGGCCGACATCTGGAGCGTCGGCGT 




2320 



.Bam H I 

r ATcrTrTAr ATrTTCCTCGCCGGCGTGCCTCCCTTCTGGGCAGGTCGGATCCGTCCGTGTTCGTCCTAGACGATATACA 




2400 



EXON 1 



GAACCCGACGATGGATTTGCTTCTCAGCCCTGTTCT TGCATCACCAGAGAACGAGAACGGCATCT TCACCGCCATCCTGC 

t . > • - 4 - - l... .i ■ ■ I i ■ ■ mitt ■ ■ » ^ ' 



INTRON 1 



EXON 2 



r:Af5P.crAfirTTCACCTCTCCAGCGAGCCATGGCCACACATCTCGCCGGGAGCCAAGGATCTCGTCAAGAAGATGCTCAAC 




2560 



ATCAACCCCAAGGAGCGGCTC 



ACGGCGTTCCAGGTCCTCAGTAAGTACCCAGATCGTTGCTGTCATAC 



ACTCATATGAAT 




2640 



EXON 2 



trt ATTRTTC ATGAGCAACGATCGAGCGGATTTGGTGAACTTGT^GATCACCC ATGGATC AAA6AAGACGGAGACGCGCC 



2720 



INTRON 2 



EXON 3 



tga'cacgccgcttgacaacgttgttctcgaca ggctcaagcagttcagggccatgaaccagttcaagaaagcagcattga 2eoo 




EXON 3 
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mJCDPKoene Map M > 4165) Site and S equence 

prr-T at at^ atp TRA TAAAAnCTCCACAAATACAACTT CTGAAGAACAGCAATGCTTACA C GGCAGAATTTTCATTATAA ^ 




i ^hu i ^.j..^ p * .i^vJimwi-Hwarg 



INTRON 3 



ATGCTCTT6ATGACATAATGTTAGATCATAGCTGGGTGCCT ATCCGAAGAGGAGATCACAGGGCTGAAGGAGATGTTCAA 




INTRON 3 



GAACA 



TTr*r AAr>r?ATAAr AErrcnrc apt ATT ACCCTCGACGAGCTCAAACACGGGTTGGCAAAGCACGGGCCCAAGCTGT 




3040 



CAG 



ACAGCGAAATGGAGAAACT AATGGAAGC AGTGAGTTTTC AGAGT ACAATCTT AAAAAAAGGAATTGTGATTCTTTTC ^ 



EXON 4 



INTRON 4 



AAAATGAAGAAGTAATCTGAAAACATC CCTGCTGAAATGCTTTAT ACATTTCCAGGCTGA CGCTGACGGCAACGGGTTAA ^ 



INTRON 4 



EXON 5 



EcoR I 



TTGACTACGACGAA 



■^TTrnTrArrcrAArARTCrATATGAACAAACTGGATAGAGAAGAGCACCTTTACACAGCATTCCAG 




3280 



EXON 5 



EcoR I 



TATTTrn at a acrap AAr.ACCGGGT AAGTTGAACGTTAAAATG AT AC AGCTGGT ACCTg'aATTCTGGAC AAC ACATATCA 




EXON 5 



r * *r *nn a r at at atat a ATTTRTTTATCTCAC AGGTACATTACT AAAGAAGAGCTTG AGCA CGCCTTGAAGGAGCAAGG ^ 




INTRON 5 



T GTTGTAT 



^ArnrrrATAA a a TP A A AftAP ATC ATCTCCGATGCCGACTCT G AC AATGT A AGG AAC AAAC ATTATTT AAATT ^ 




EXON 6 
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jlCPPKqgne Mao (1 >4165) Site and Sequence 

TCAGCCGACAAACTAAACTATAGAAACCACATC ATGATATCAAATTTTGAGGTGGCGGTGCTACAGAAATAGAACCCAGT 



INTRON 6 



ACACCAAAATGACTAACTTGTCATGATTAGTTGTTCCTCGTAACTGAACATTTGTGTTCTTAG'TTTCTTATTGTTAAACC 



INTRON 6 



AAAGACTTAAATTCACTTTTGCACATGCAGGATGGAAGGATAGATTATTCAGAGTTTGTGGCGATGATGAGGAAA6GGAC ^ 




INTRON 6 



EXON 7 



CSGCTSGTSCCCAGCCAATGAACATCAAGAAGAGGCGAGAC ATAGTCCT A TAGTGAAGTGAAGCAGWAAGTGTGTAATGTA 

. | i 1 - - - 1 ■ - - — *■ ■ — '**-■'' ' OO"? \f 



EXON 7 



ATGTGTATAGCAGCTCAAACAAGCAAATTTGTACATCTGTACACAAATGCAATGGGGTTACTTTTGCAACTTAGTTCATG 



GATCGTTCTCTACGTTGTGCTATTGATTGCAAGTGATTTGAAAGACATGCATACTTAGGAACTGAGAAAGATAGATCTAC 



4000 




4080 




AGGCC 



4165 
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Fig. 3/ 

E:\GF2-PAT.SEQ 9/21/92 

NAME : PCIB5515 7415 BPS DMA CIRCULAR UPDATED 9/21/92 

DESCRIPTION: pGFl with synthetic geiaer fix. 



*** SEQUENCE 



* * * 



61 



ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
MetAapAan AanProAan IleAanGluCya IleProTyr AsnCyaLeu SerAanProGlu 

GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGiyTyr ThrProIle AapIleSerLeu 

191 AfiCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
121 S«£euThr GlnPhSefLeuSerGiuPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAapIle IleTrpGly IlePheGlyPro SerGlnTrp AapAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu IleAanGln ArglleGluGlu PheAlaArg AanGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC G^CGAGAGCT TCCGCGAGTG GGAGGCCGAC 
GluGlyLeu SerAanLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaAap 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TC^CGACAT GAACAGCGCC 
ProThrAan ProAlaLeu ArgGluGluMet ArglleGln PheAanAap MetAanSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCI GCTGAGCGTG 
LeuThrThr AlallePro LeuPheAlaVal GlnAanTyr GlnValPro LeuLeuSerVal 

TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
TyrValGln AlaAlaAan LeuHiaLeuSer ValLeuArg AapValSe* ValPheGlyGln 

CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAapAla AlaThcIleAan SerArgTyr AanAapLeu ThrArgLeuIU 

GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGT 
GlyAanTyr ThrAapHia AlaValArgTrp TyrAanThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC CAGTTCCGCC ^G^fTaC CCTGACCGTG 
ProAapSer ArgAapTrp IleArgTyrAan GlnPheArg ArgGluLeu ThrLeuThrVal 

CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
LeuAapIle ValSerLeu PheProAanTyr AspSerArg ThrTyrPro IleArgThrVal 

AGCCAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
SerGlnLeu ThrArgGlu IleTyrThrAan ProValLeu GluAanPhe AapGlySerPhe 

CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHxaLeu MetAapIleLeu 

AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
A^nS«Iie ThrlleTyr ThrAapAlaHia ArgGlyGlu TyrTyrTrp SerGlyHiaGln 

ATCATGGCCA GCCCCGTCGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
IxSeSa^erProVaX GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 

.TOVfhJ.rr CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
A^Pro^l^nA^glle ValAlaGln LeuGlyGln GlyValTyrA.g 

r - GAGCA gcaccctGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
T^rSef Se^nrLeu TyrArgArgPro PheAanlle GlylleAan AanGlnGlnleu 

AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CA^C^^TG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAanLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLya SerGlyThr ValAapSerLeu AapGluIle ProProGln AanAsnAanVal 
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1261 
1321 
1381 
1441 
1501 
1561 
1621 
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2461 



S3S?SH5?LSS2Sa EES? 

COCCOCMC* GCCCOKCCJ SIS G»S 2«^Sf .iSISS^ S^SSS 
ArgArgThr SerProGly GlnlleSerThr LeuArgvax 

„ , rrv , rrl .. rT rCAATCGGAA AATGTGGGGA GCCGAATCGA 
GCTTCCTTAT GQCCGCTTTC AGCCCCAAGT ^^ AA LysCyaGly GluProAsnArg 
GlySerLcu TrpProLeu 5erAiarc«o C 
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2 641 AGAGCGGAGA AAAAATGGAG AGACAAACGT GAAAAATTGG AATGGGAAAC AAATATTGTT 
ArgAlaGlu LyaLyaTrp ArgAapLyaArg GluLysLeu GluTrpGlu ThrAanlleVal 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCT TTATTTGTAA ACTCTCAATA TGATAGATTA 
TyrLysGlu AlaLyaGlu SerValAapAla LeuPheVal AsnSerGln TyrAapArgLeu 

27 61 CAAGCGGATA CCAACATCGC GATGATTCAT GCGGCAGATA AACGCGTTCA TAGCATTCGA 
GlnAlaAap ThrAanlle AlaMetlleHia AlaAlaAsp LyaArgVal HisSerlleArg 

2821 GAAGCTTATC TGCCTGAGCT GTCTGTGATT CCGGGTGTCA ATGCGGCTAT TTTTGAAGAA 
GluAlaTyr LeuProGlu LeuSerValUe ProGlyVal AanAlaAla IlePheGluGlu 

2881 TTAGAAGGGC GTATTTTCAC TGCATTCTCC CTATATGATG CGAGAAATGT CATTAAAAAT 
LeuGluGly ArgllePhe ThrAlaPheSer LeuTyrAsp AlaArgAan VallleLysAsn 

2 941 GGTGATTTTA ATAATGGCTT ATCCTGCTGG AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
GlyAspPhe AsnAsnGly LeuSerCyaTrp AanValLya GlyHisVal AapValGluGlu 

3001 CAAAACAACC ACCGTTCGGT CCTTGTTGTT CCGGAATGGG AAGCAGAAGT GTCACAAGAA 
GlnAanAan HiaArgSer ValLeuValVal ProGluTrp GluAlaGlu ValSerGlnGlu 

30 61 GTTCGTGTCT GTCCGGGTCG TGGCTATATC CTTCGTGTCA CAGCGTACAA GGAGGGATAT 
ValArgVal CysProGly ArgGlyTyrlle LeuArgVal ThrAlaTyr LysGluGlyTyr 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGly CyaValThr IleHisGluIle GluAanAan ThrAapGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA TTATACTGCG 
AsnCysVal GluGluGlu ValTyrProAsn AsnThrVal ThrCyaAan AapTyrThrAla 

32 41 ACTCAAGAAG AATATGAGGG TACGTACACT TCTCGTAATC G A GG AT AT GA CGGAGCCTAT 
ThrGlnGlu GluTyrGlu GlyThrTy rThr SerArgAan ArgGlyTyr AspGlyAlaTyr 

3301 GAAAGCAATT CTTCTGTACC AGCTGATTAT GCATCAGCCT ATGAAGAAAA AGCATATACA 
GluSerAsn SerSerVal ProAlaAapTyr AlaSerAla TyrGluGlu LysAlaTyrThr 

3361 GATGGACGAA GAGACAATCC TTGTGAATCT AACAGAGGAT ATGGGGATTA CACACCACTA 
AapGlyArg ArgAapAan ProCyaGluSer AanArgGly TyrGlyAap TyrThrProLeu 

34 21 CCAGCTGGCT ATGTGACAAA AGAATTAGAG TACTTCCCAG AAACCGATAA GGTATGGATT 
ProA laGly TyrValThr LysGluLeuGlu TyrPhePro GluThrAap LyaValTrpIle 

34 81 GAGATCGGAG AAACGGAAGG AACATTCATC GTGGACAGCG TGGAATTACT TCTTATGGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAapSer ValGluLeu LeuLeuMetGlu 

3541 GAATAA 
Glu 
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